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DESCRIPTION 
Human Pr q t 0 ^ Havina Hydrophobic 

n^ins »nri r » H a fi p^nriinn These Proteins 

5 

TPrWNTCAT, FTKTjD 

The present invention relates to human proteins having 
hydrophobic domains, DNAs coding for these proteins, and 
expression vectors for these DNAs as well as eucaryotic 

10 cells expressing these DNAs. The proteins of the present 
invention can be employed as pharmaceuticals or as antigens 
for preparing antibodies against these proteins. The human 
cDNAs of the present invention can be utilized as probes for 
the genetic diagnosis and gene sources for the gene therapy. 

15 Furthermore, the cDNAs can be utilized as gene sources for 
large-scale production of the proteins encoded by these 
cDNAs. Cells into which these genes are introduced to 
express secretory proteins and membrane proteins in large 
amounts can be utilized for detection of the corresponding 

20 receptors and ligands, screening of novel low-molecular 
pharmaceuticals, and so on. 

BACKGROUND ART 

Cells secrete many proteins outside the cells. These 
25 secretory proteins play important roles for the 
proliferation control, the differentiation induction, the 
material transportation, the biological protection, etc. in 
the cells. Different from intracellular proteins, the 
secretory proteins exert their actions outside the cells, 



manner sucn .sl : 
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hidden potentialities as medicines. In fact, a number of 
human secretory proteins such as interferons, interleukins , 
erythropoietin, thrombolytic agents, etc- have been 
currently employed as medicines. In addition, secretory 
proteins other than those described above have been 
undergoing clinical trials to develop as pharmaceuticals. 
Because it has been conceived that the human cells still 
produce many unknown secretory proteins, availability of 
these secretory proteins as well as genes coding for them is 
expected to lead to development of novel pharmaceuticals 
utilizing these proteins . 

On the other hand, membrane proteins play important 
roles, as signal receptors, ion channels, transporters, etc. 
in the material transportation and the information 
15 transmission through the cell membrane. Examples thereof 
include receptors for a variety of cytokines, ion channels 
for the sodium ion, the potassium ion, the chloride ion, 
etc., transporters for saccharides and amino acids, and so 
on, where the genes for many of them have been cloned 
20 already. It has been clarified that abnormalities of these 
membrane proteins are associated with a number of hitherto- 
cryptogenic diseases. Therefore, discovery of a new membrane 
protein is anticipated to lead to elucidation of the causes 
of many diseases, so that isolation of a new gene coding for 
25 the membrane protein has been desired. 

Heretofore, owing to difficulty in the purification 
from human cells, these secretory proteins and membrane 
proteins have been isolated by an approach from the gene 
- i ^n+hoH < <- the po-called expression cloning 



eucaryouc cells t express cDNAs and then screening oi the 
cells secreting, or expressing on the surface of membrane, 
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the objective active protein • However, this method is 
applicable only to cloning of a gene for a protein with a 
known function . 

In general, secretory proteins and membrane proteins 
5 possess at least one hydrophobic domain inside the proteins, 
wherein, after synthesis thereof in the ribosome, this 
domain works as a secretory signal or remains in the 
phospholipid membrane to be trapped in the membrane. 
Accordingly, the evidence of this cDNA for encoding a 
10 secretory protein and a membrane protein is provided by 
determination of the whole base sequence of a full-length 
cDNA followed by detection of highly hydrophobic domain(s) 
in the amino acid sequence of the protein encoded by this 
cDNA. 

15 

OBJECTS OF THE INVENTION 

The main object of the present invention is to provide 
novel human proteins having hydrophobic domains, DNAs coding 
for these proteins, and expression vectors for these DNAs as 
20 well as transformed eucaryotic cells that are capable of 
expressing these DNAs. This object as well as other objects 
and advantages of the present invention will become apparent 
to those skilled in the art from the following description 
with reference to the accompanying drawings. 

25 

BRIEF DESCRIPTION OF DRAWINGS 

Fig. 1 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP01550. 

Fi a . ? illustrates the hydrophobicitv/hvdronhi 1 ici tv 

K.i..q. .: i us trates trie r:y arophob ic 1 1. v / n var oi,;: ; . ;.cit.y 

profile of the protein encoded by clone HP10195. 
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Fig. 4 illustrates the 
profile of the protein encoded 
Fig. 5 illustrates the 
profile of the protein encoded 
5 Fig- 6 illustrates the 

profile of the protein encoded 
Fig. 7 illustrates the 
profile of the protein encoded 
Fig. 8 illustrates the 
10 profile of the protein encoded 
Fig. 9 illustrates the 
profile of the protein encoded 
Fig. 10 illustrates the 
profile of the protein encoded 
15 Fig. 11 illustrates the 

profile of the protein encoded 
Fig. 12 illustrates the 
profile of the protein encoded 
Fig. 13 illustrates the 
20 profile of the protein encoded 
Fig. 14 illustrates the 
profile of the protein encoded 
Fig. 15 illustrates the 
profile of the protein encoded 
25 Fig. 16 illustrates the 

profile of the protein encoded 
Fig . 1 7 illustrates the 
profile of the protein encoded 
Fin 1 o 4 i ]i]?^rate? the 

Pig . 1 y illustrates tine 
profile of the protein encoded 



hydrophobic ity/hydrophilicity 
by clone HP10423. 

hydrophob ic ity / hydrophil ic ity 
by clone HP10506. 

hydrophob ic ity / hydrophil ic ity 
by clone HP10507. 

hydrophob ic ity / hydrophil ic ity 
by clone HP10548. 

hydrophobicity /hydrophil icity 
by clone HP10566. 

hy dr o phob ic ity / hydr oph il ic ity 
by clone HP10567. 

hydr ophob ic ity / hy dr ophi 1 ic ity 
by clone HP10568. 

hydrophobicity /hydrophilicity 
by clone HP01426. 

hydr ophob ic ity / hydrophil ic ity 
by clone HP02515. 

hydrophobicity/hydrophilicity 
by clone HP02575. 

hydrophobic ity / hydrophil ic ity 
by clone HP10357. 

hydrophobicity/hydrophilicity 

by clone HP10447. 

hydr ophob ic ity / hydr oph il ic ity 

by clone HP10477. 

hydrophobic ity / hydr ophil ic ity 

by clone HP10513. 
hvdr ophob ic ity / hydroph i 1 ic i ty 

nydroprioDic ity / nyaropniii c 1. 1. y 
by clone KP10557. 
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Fig. 2 0 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10563. 

Fig. 21 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP01467. 
5 Fig. 22 illustrates the hydrophobicity/hydrophilicity 

profile of the protein encoded by clone HP01956. 

Fig. 23 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP0 2545. 

Fig. 24 illustrates the hydrophobicity/hydrophilicity 
10 profile of the protein encoded by clone HP02551. 

Fig. 25 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP02631. 

Fig. 26 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP02632. 
15 Fig. 27 illustrates the hydrophobicity/hydrophilicity 

profile of the protein encoded by clone HP10488. 

Fig. 28 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10538. 

Fig. 29 illustrates the hydrophobicity/hydrophilicity 
20 profile of the protein encoded by clone HP10542. 

Fig. 3 0 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10571. 

Fig. 31 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP01470. 
25 Fig. 32 illustrates the hydrophobicity/hydrophilicity 

profile of the protein encoded by clone HP02419. 

Fig. 33 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP02631. 

rr ; ~ - - : i i . . c* + >- +- r-r- + b o v -orb i *~ i V ? h vi r ^T"^ "i 1 i ^ i 1" V 

Fig . 3 b i ^lus trates me hydrophobic it y / hydropni 1 uuty 
profile of the protein encoded by clone HP10031. 
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Fig- 3 6 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10530. 

Fig. 3 7 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10541. 
5 Fig . 3 8 illustrates the hydrophobicity/hydrophilicity 

profile of the protein encoded by clone HP10550. 

Fig. 39 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10590. 

Fig. 4 0 illustrates the hydrophobicity/hydrophilicity 
10 profile of the protein encoded by clone HP10591. 

Fig . 4 1 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP01462. 

Fig. 42 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP02485. 
15 Fig. 43 illustrates the hydrophobicity/hydrophilicity 

profile of the protein encoded by clone HP02798. 

Fig. 44 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10041. 

Fig. 45 illustrates the hydrophobicity/hydrophilicity 
20 profile of the protein encoded by clone HP10246. 

Fig. 46 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10392. 

Fig. 4 7 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10489. 
25 Fig. 4 8 illustrates the hydrophobicity/hydrophilicity 

profile of the protein encoded by clone HP10519. 

Fig. 4 9 illustrates the hydrophobicity/hydrophilicity 
profile of the protein encoded by clone HP10531. 

Fi o . r - ^ i net raf ^ h ^ hvd r oohob i c i t v / h vd roph i 1 icitv 

STTMMARY OF T'Jtt 'MVEKTION 
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As the result of intensive studies, the present 
inventors have been successful in cloning of cDNAs coding 
for proteins having hydrophobic domains from the human full- 
length cDNA bank, thereby completing the present invention. 
5 In other words, the present invention provides human 
proteins having hydrophobic domains, namely proteins 
comprising any of the amino acid sequences represented by 
SEQ ID Nos. 1 to 10, 31 to 40, 61 to 70, 91 to 100, and 121 
to 130. Moreover, the present invention provides DNAs coding 

10 for the above-mentioned proteins, exemplified by cDNAs 
comprising any of the base sequences represented by SEQ ID 
Nos. 11 to 20, 41 to 50, 71 to 80, 101 to 110, and 131 to 
14 0, as well as expression vectors that are capable of 
expressing any of these DNAs by in vitro translation or in 

15 eucaryotic cells and transformed eucaryotic cells that are 
capable of expressing these DNAs and of producing the above- 
mentioned proteins . 

DFTATLED DESCRIPTION OF THE INVENTION 

20 The proteins of the present invention can be obtained, 

for example, by a method for isolation from human organs, 
cell lines, etc., a method for preparation of peptides by 
the chemical synthesis, or a method for production with the 
recombinant DNA technology using the DNAs coding for the 

25 hydrophobic domains of the present invention, among which 
the method for production with the recombinant DNA 
technology is employed preferably. For instance, in vitro 
expression of the proteins can be achieved by preparation of 
-in PNA hv ^ r vi^r^ ranscr i pt i on from a vector havina one of 

translation using trus RNA as a template. Also, .introduction 
of the translated region into a suitable expression vector 



WO 00/05367 PCT/JP99/03929 



by the method known in the art leads to expression of a 
large amount of the encoded protein in prokaryotic cells 
such as Escherichia coli f Bacillus subtilis, etc., and 
eucaryotic cells such as yeasts, insect cells, mammalian 
5 cells, etc. 

In the case where one of the proteins of the present 
invention is produced by expressing the DNA by in vitro 
translation, the protein of the present invention can be 
produced in vitro, when the translated region of this cDNA 

10 is introduced into a vector having an RNA polymerase 
promoter, followed by addition of the vector to an in vitro 
translation system such as a rabbit reticulocyte lysate or a 
wheat germ extract, containing an RNA polymerase 
corresponding to the promoter. RNA polymerase promoters are 

15 exemplified by T7 , T3 , SP6, and the like. The vectors 
containing these RNA polymerase promoters are exemplified by 
pKAl, pCDM8, pT3/T7 18, pT7/3 19, pBluescript II, and so on. 
Furthermore, the protein of the present invention can be 
expressed as the secreted form or the form incorporated into 

20 the microsome membrane, when a canine pancreas microsome or 
the like is added to the reaction system. 

In the case where one of the protein of the present 
invention is produced by expressing the DNA in a 
microorganism such as Escherichia coli etc., a recombinant 

25 expression vector bearing the translated region of the cDNA 
of the present invention is constructed in an expression 
vector having an origin which can be replicated in the 
microorganism, a promoter, a ribosome-binding site, a cDNA- 
cloning s ite , a terminator etc . and , after transformation of 

:..raris loniian; ; ::cuDaiea , wnereoy z ne protein encoaea ny 

said cDNA can be produced on a large scale in the 
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microorganism. In this case, a protein fragment containing 
any region can be obtained by carrying out the expression 
with inserting an initiation codon and a termination codon 
in front of and behind the selected translated region. 
5 Alternatively, a fusion protein with another protein can be 
expressed. Only the portion of the protein encoded by this 
cDNA can be obtained by cleavage of this fusion protein with 
a suitable protease. The expression vector for Escherichia 
coli is exemplified by the pUC series, pBluescript II, the 

10 pET expression system, the pGEX expression system, and so on. 

In the case where one of the proteins of the present 
invention is produced by expressing the DNA in eucaryotic 
cells, the protein of the present invention can be produced 
as a secretory protein or as a membrane protein on the cell- 

15 membrane surface, when the translated region of this cDNA is 
introduced into an expression vector for eucaryotic cells 
that has a promoter, a splicing region, a poly(A) addition 
site, etc . , followed by introduction into the eucaryotic 
cells. The expression vector is exemplified by pKAl , 

20 pED6dpc2, pCDM8, pSVK3 , pMSG, pSVL, pBK-CMV, pBK-RSV, EBV 
vector, pRS , pYES2 , and so on. Examples of eucaryotic cells 
to be used in general include mammalian cultured cells such 
as simian kidney cells COS7, Chinese hamster ovary cells CHO, 
etc., budding yeasts, fission yeasts, silkworm cells, 

25 Xenopus oocytes, and so on, but any eucaryotic cells may be 
used, provided that they are capable of expressing the 
proteins of the present invention. The expression vector can 
be introduced into the eucaryotic cells by methods known in 
the art such as the electroporat ion method, the calcium 

me tnoa, ana s o o i : . 

After one of the proteins of the present invention is 
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expressed in prokaryotic cells or eucaryotic cells, the 
objective protein can be isolated from the culture and 
purified by a combination of separation procedures known in 
the art. Such examples include treatment with a denaturing 
5 agent such as urea or a detergent, sonication, enzymatic 
digestion, salting-out or solvent precipitation, dialysis , 
centrifugation, ultrafiltration, gel filtration, SDS-PAGE, 
isoelectric focus ing , ion-exchange chromatography , 
hydrophobic chromatography, affinity chromatography, reverse 

10 phase chromatography, and so on. 

The proteins of the present invention include peptide 
fragments (5 amino acid residues or more) containing any 
partial amino acid sequence in the amino acid sequences 
represented by SEQ ID Nos. 1. to 10, 31 to 40, 61 to 70, 91 

15 to 100 , and 121 to 130 . These peptide fragments can be 
utilized as antigens for preparation of antibodies. Hereupon, 
among the proteins of the present invention, those having 
the signal sequences are secreted in the form of mature 
proteins, after the signal sequences are removed. Therefore, 

20 these mature proteins shall come within the scope of the 
present invention. The N-terminal amino acid sequences of 
the mature proteins can be easily determined by using the 
method for the determination of cleavage site of a signal 
sequence [JP 8-187100 A]. Furthermore, some membrane 

25 proteins undergo the processing on the cell surface to be 
converted to the secretory forms. Such proteins or peptides 
in the secretory forms shall come within the scope of the 
present invention. In the case where sugar chain-binding 
sites ar* 3 present in thp amino acid seauenees , expression in 

::nams are aitacnea . Accordingly , such proteins or peptides 
to which sugar chains are attached shall come within the 
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scope of the present invention. 

The DNAs of the present invention include all the DNAs 
coding for the above-mentioned proteins . These DNAs can be 
obtained by using a method by chemical synthesis, a method 
5 by cDNA cloning, and so on. 

The cDNAs of the present invention can be cloned, for 
example, from cDNA libraries derived from the human cells • 
These cDNAs are synthesized by using as templates poly(A)* 
RNAs extracted from human cells . The human cells may be 

10 cells delivered from the human body, for example, by the 
operation or may be the cultured ceils . The cDNAs can be 
synthesized by using any method selected from the Okayama- 
Berg method [Okayama, H. and Berg, P., Mol. Cell. Biol. 2: 
161-170 (1982)], the Gubler-Hof fman method [Gubler, U. and 

15 Hoffman, J. Gene 25: 263-269 (1983)], and so on, but it is 
preferred to use the capping method [Kato, S. et al., Gene 
150: 243-250 (1994)], as exemplified in Examples, in order 
to obtain a full-length clone in an effective manner. In 
addition, commercially available, human cDNA libraries can 

20 be utilized. Cloning of the cDNAs of the present invention 
from the cDNA libraries can be carried out by synthesis of 
an oligonucleotide on the basis of base sequences of any 
portion in the cDNA of the present invention, followed by 
screening using this oligonucleotide as the probe according 

25 to the colony or plaque hybridization by a method known in 
the art . In addition , the cDNA fragments of the present 
invention can be prepared by synthesis of oligonucleotides 
which hybridize with both termini of the objective cDNA 
fragment, followed by the usage of these oligonucleotides as 

ron; numaii cfuib . 



The cDNAs of the present invention are cliara'.t^-.ized by 
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comprising either of the base sequences represented by SEQ 
ID Nos. 11 to 20, 41 to 50, 71 to 80, 101 to 110, and 131 to 
14 0 or the base sequences represented by SEQ ID Nos . 21 to 
30, 51 to 60, 81 to 90, 111 to 120, and 141 to 150. Table 1 
5 summarizes the clone number ( HP number ) , the cells from 
which the cDNA was obtained, the total base number of the 
cDNA, and the number of the amino acid residues of the 
encoded protein, for each of the cDNAs. 
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Table 1 





ID No. 


HP 
number 


Cells 


Base 
number 


Number 
of amino 

acid 
residues 


1, 


11, 


21 


HP01550 


Stomach 


cancer 


510 


125 


2, 


12, 


22 


HP02593 


Saos-2 




697 


131 


3, 


13, 


23 


HP10195 


HT-1080 




1619 


242 


4, 


14, 


24 


HP10423 


U-2 OS 




1066 


264 


5, 


15, 


25 


HP10506 


Stomach 


cancer 


618 


112 


6, 


16, 


26 


HP10507 


Stomach 


cancer 


1021 


146 


7, 


17, 


27 


HP10548 


Stomach 


cancer 


1432 


344 


8, 


18, 


28 


HP10566 


Stomach 


cancer 


601 


97 


9, 


19, 


29 


HP10567 


Stomach 


cancer 


585 


124 


10, 


20, 


30 


HP10569 


Stomach 


cancer 


1100 


327 


31, 


41, 


51 


HP01426 


Stomach 


cancer 


1065 


313 


32, 


42, 


52 


HP02515 


Saos-2 




937 


229 


33, 


43, 


53 


HP02575 


Saos-2 




1678 


467 


34, 


44, 


54 


HP10357 


Stomach 


cancer 


467 


99 


35, 


45, 


55 


HP10447 


Liver 




875 


189 


36, 


46, 


56 


HP10477 


Liver 




1256 


363 


37, 


47, 


57 


HP10513 


Stomach 


cancer 


884 


249 


38, 


48, 


58 


HP10540 


Saos-2 




589 


98 


39, 


49, 


59 


HP10557 


Stomach 


cancer 


673 


172 


40 


50, 


60 


HP10563 


Saos-2 




1425 


120 


61, 


71, 


81 


HP01467 


HT-1080 




1436 


307 


62, 


72, 


82 


HP01956 


Liver 




997 


183 


63, 


73, 


83 


HP02545 


Saos-2 




1753 


327 


64, 


74, 


84 


HP02551 


Saos-2 




1117 


223 


65, 


75, 


85 


HP02631 


Saos-2 




1380 


48 


66, 


76, 


86 


HP02632 


HT-1080 




1503 


371 








::r 1 n A P Q 






' 73? 


90 








H ? 1 C b 4 




-. : a n c ♦.„• 






i 70, 


80, 


90 


; HP10571 


1 Stomach 


cancer 


1229 


i 152 
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91/ 


10 1 , 


111 


HP 0 1 4 7 0 


Stomach 


cancer 


lo 19 


358 


92, 


102 , 


112 


HP02419 


Stomach 


cancer 


2054 


226 


"93, 


103 , 


113 


HP02631 


Saos-2 




1380 


195 


94, 


104 , 


114 


HP02695 


Stomach 


cancer 


1292 


339 


95, 


105 , 


115 


HP 10 031 


Saos-2 




2168 


487 


96, 


106 , 


116 


HP10530 


Saos-2 




1357 


393 


97, 


107, 


117 


HP10541 


Stomach 


cancer 


711 


196 


98, 


108 , 


118 


HP10550 


Stomach 


cancer 


651 


107 


99, 


109, 


119 


HP10590 


HT-1080 




1310 


350 


100, 


110 , 


120 


HP10591 


HT-1080 




1400 


107 


121, 


131, 


141 


HP01462 


HT-1080 




2050 


483 


122, 


132 , 


142 


HP02485 


Stomach 


cancer 


2746 


334 


123, 


133, 


143 


HP02798 


HT-1080 




1136 


267 


124, 


134 , 


144 


HP10041 


Saos-2 




619 


106 


125, 


135, 


145 


HP10246 


KB 




864 


224 


126, 


136, 


146 


HP10392 


U-2 OS 




1527 


258 


127, 


137, 


147 


HP10489 


Stomach 


cancer 


659 


110 


128, 


138, 


148 


HP10519 


Stomach 


cancer 


710 


91 


129, 


139, 


149 


HP10531 


Saos-2 




2182 


344 


130, 


140 , 


150 


HP10574 


Stomach 


cancer 


2773 


428 



Hereupon , the same clones as the cDNAs of the present 
invention can be easily obtained by screening of the cDNA 
libraries constructed from the human cell lines or human 
5 tissues utilized in the present invention by the use of an 
oligonucleotide probe synthesized on the basis of the cDNA 
base sequence described in any of SEQ ID Nos . 11 to 30, 41 
to 60, 71 to 90, 101 to 120, and 131 to 150. 

In general, the polymorphism due to the individual 
10 difference is frequently observed in human genes. 



SEQ ID Nos - II to 30, I ^ to bG, 7 1 zo 90, 



1 2 C , ana 
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131 to 150 shall come within the scope of the present 
invention . 

In a similar manner, any protein in which one or plural 
amino acids are inserted, deleted and/or substituted with 
5 other amino acids shall come within the scope of the present 
invention , as far as the protein possesses the activity of 
any protein having the amino acid sequences represented by 
SEQ ID Nos. 1 to 10, 31 to 40, 61 to 70, 91 to 100, and 121 
to 130. 

10 The cDNAs of the present invention include cDNA 

fragments (10 bp or more ) containing any partial base 
sequence in the base sequences represented by SEQ ID Nos* 11 
to 20, 41 to 50, 71 to 80, 101 to 110, and 131 to 140 or in 
the base sequences represented by SEQ ID Nos. 21 to 30, 51 

15 to 60, 81 to 90, 111 to 120, and 141 to 150. Also, DNA 
fragments consisting of a sense strand and an anti-sense 
strand shall come within this scope. These DNA fragments can 
be utilized as the probes for the genetic diagnosis. 

In addition to the activities and uses described above, 

20 the polynucleotides and proteins of the present invention 
may exhibit one or more of the uses or biological activities 
(including those associated with assays cited herein) 
identified below. Uses or activities described for proteins 
of the present invention may be provided by administration 

25 or use of such proteins or by administration or use of 
polynucleotides encoding such proteins (such as, for example, 
in gene therapies or vectors suitable for introduction of 
DNA) . 

various purposes . 
express recombinant 



car: usea by * researcr: community 

The polynucleotides can be used to 
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protein for analysis , characterization or therapeutic use; 
as markers for tissues in which the corresponding protein is 
preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or 
5 in disease states); as molecular weight markers on Southern 
gels; as chromosome markers or tags (when labeled) to 
identify chromosomes or to map related gene positions; to 
compare with endogenous DNA sequences in patients to 
identify potential genetic disorders ; as probes to hybridize 

10 and thus discover novel, related DNA sequences; as a source 
of information to derive PCR primers for genetic 
fingerprinting; as a probe to "subtract-out" known sequences 
in the process of discovering other novel polynucleotides ; 
for selecting and making oligomers for attachment to a "gene 

15 chip" or other support, including for examination of 
expression patterns; to raise anti-protein antibodiesusing 
DNA immunization techniques; and as an antigen to raise 
anti-DNA antibodies or elicit another immune response. 
Where the polynucleotide encodes a protein which binds or 

20 potentially binds to another protein ( such as , for example , 
in a receptor-ligand interaction), the polynucleotide can 
also be used in interaction trap assays (such as, for 
example, that described in Gyuris et al., Cell 75:791-803 
(1993)) to identify polynucleotides encoding the other 

25 protein with which binding occurs or to identify inhibitors 
of the binding interaction. 

The proteins provided by the present invention can 
similarly be used in assay to determine biological activity, 
including in a panel of multiple proteins for high- 



anotne: iTJiiune response; at? a reageni . ^nciuainq me iadeiea 
reagent) in assays designed to quantitatively determine 
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levels of the protein (or its receptor) in biological 
fluids; as markers for tissues in which the corresponding 
protein is preferentially expressed (either constitutively 
or at a particular stage of tissue differentiation or 
5 development or in a disease state); and, of course, to 
isolate correlative receptors or ligands . Where the protein 
binds or potentially binds to another protein (such as, for 
example, in a receptor-ligand interaction), the protein can 
be used to identify the other protein with which binding 
10 occurs or to identify inhibitors of the binding interaction. 
Proteins involved in these binding interactions can also be 
used to screen for peptide or small molecule inhibitors or 
agonists of the binding interaction. 

Any or all of these research utilities are capable of 
15 being developed into reagent grade or kit format for 
commercialization as research products. 

Methods for performing the uses listed above are well 
known to those skilled in the art. References disclosing 
such methods include without limitation "Molecular Cloning: 
20 A Laboratory Manual", 2d ed. , Cold Spring Harbor Laboratory 
Press, Sambrook, J., E.F. Fritsch and T. Maniatis eds., 1989, 
and "Methods in Enzymology: Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S.L. and A.R. Kimmel 
eds., 1987. 
25 Nutritional Uses 

Polynucleotides and proteins of the present invention 
can also be used as nutritional sources or supplements. 
Such uses include without limitation use as a protein or 

such cases the protein ox polynucleotide oi the invention 
can be added to the feed of a particular organism or can be 
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administered as a separate solid or liquid preparation, such 
as in the form of powder, pills, solutions, suspensions or 
capsules . In the case of microorganisms , the protein or 
polynucleotide of the invention can be added to the medium 
5 in or on which the microorganism is cultured. 

cytokine and Ce ll Proliferation/Differentiation 
Activity 

A protein of the present invention may exhibit cytokine, 
cell proliferation (either inducing or inhibiting) or cell 

10 differentiation (either inducing or inhibiting) activity or 
may induce production of other cytokines in certain cell 
populations. Many protein factors discovered to date, 
including all known cytokines, have exhibited activity in 
one or more factor dependent cell proliferation assays, and 

15 hence the assays serve as a convenient confirmation of 
cytokine activity. The activity of a protein of the present 
invention is evidenced by any one of a number of routine 
factor dependent cell proliferation assays for cell lines 
including, without limitation, 3 2D, DA2 , DA1G, T10, B9, 

20 B9/11, BaF3 , MC9/G, M+ ( preB M+) , 2E8, RB5, DAI, 123, T1165, 
HT2 , CTLL2, TF-1, Mo7e and CMK. 

The activity of a protein of the invention may, among 
other means, be measured by the following methods: 

Assays for T-cell or thymocyte proliferation include 

25 without limitation those described in: Current Protocols in 
Immunology, Ed by J. E . Coligan, A.M. Kruisbeek, D.H. 
Margiilies, E.M. Shevach, W Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro 

1 37:3494-3500, 1986; Bertagnoil i el. al . , ,■ , Immunol . 
145: 1706-1712 , 1990 ; Bertagnolli et al . , Cellular 
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Immunology 133:327-341, 1991; Bertagnolli, et al., J. 

Immunol. 149:3778-3783, 1992; Bowman et al., J. Immunol, 

152: 1756-1761, 1994. 

Assays for cytokine production and/or proliferation of 
5 spleen cells, lymph node cells or thymocytes include, 

without limitation/ those described in: Polyclonal T cell 

stimulation, Kruisbeek, A.M. and Shevach, E.M. In Current 

Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 

3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 
10 Measurement of mouse and human Interferon y, Schreiber, R.D. 

In Current Protocols in Immunology. J.E.e.a, Coligan eds. 

Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of 

hematopoietic and lymphopoietic cells include, without 
15 limitation, those described in: Measurement of Human and 

Murine Interleukin 2 and Interleukin 4, Bottomly, K., Davis, 

L.S. and Lipsky, P.E. In Current Protocols in Immunology. 

J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and 

Sons, Toronto. 1991; deVries et al., J. Exp, Med. 173:1205- 
20 1211, 1991; Moreau et al., Nature 336:690-692, 1988; 

Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931- 

2938, 1983; Measurement of mouse and human interleukin 6- 

Nordan, R. In Current Protocols in Immunology. J.E.e.a. 

Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, 
25 Toronto. 1991; Smith et al . , Proc. Natl. Acad. Sci. U.S.A. 

83:1857-1861, 1986; Measurement of human Interleukin 11 - 

Bennett, F. , Giannotti, J. , Clark, S.C. and Turner, K. J . 

In Current Protocols in Immunology. J.E.e.a. Coligan eds. 

vni 1 np r i ^ -i John Wilev and Sons, Toronto. 1991; 

Giannotti., . , Clar^S.C. ana Turner, V .u . In ■; :urrent. 

Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 
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6.13,1, John Wiley and Sons, Toronto. 1991. 

Assays for T-cell clone responses to antigens (which 
will identify, among others , proteins that affect APC-T cell 
interactions as well as direct T-cell effects by measuring 
5 proliferation and cytokine production) include, without 
limitation, those described in: Current Protocols in 
Immunology , Ed by J . E . Coligan , A.M. Kruisbeek , D . H . 
Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing 
Associates and Wiley-Interscience (Chapter 3, In Vitro 

10 assays for Mouse Lymphocyte Function; Chapter 6, Cytokines 
and their cellular receptors; Chapter 7, Immunologic studies 
in Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 
77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 
11:405-411, 1981; Takai et al., J. Immunol. 137:3494-3500, 

15 . 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

Tmmnnp. stimula ting or Suppressi ng Act i v i ty 
A protein of the present invention may also exhibit 
immune stimulating or immune suppressing activity, including 
without limitation the activities for which assays are 

20 described herein. A protein may be useful in the treatment 
of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g., in 
regulating (up or down) growth and proliferation of T and/or 
B lymphocytes, as well as effecting the cytolytic activity 

25 of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial orfungal infections, or may result 
from autoijnmune disorders. More specifically, infectious 

.invention, including infections by HIV, hepatitis viruses, 
herpesviruses, mycobacteria, Leishmania spp., malaria spp. 
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and various fungal infections such as candidiasis. Of 
course, in this regard, a protein of the present invention 
may also be useful where a boost to the immune system 
generally may be desirable, i.e., in the treatment of cancer. 
5 Autoimmune disorders which may be treated using a 

protein of the present invention include, for example, 
connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune 
pulmonary inflammation, Guillain-Barre syndrome, autoimmune 
10 thyroiditis, insulin dependent diabetes mellitis, myasthenia 
gravis, graf t-versus-host disease and autoimmune 
inflammatory eye disease. Such a protein of the present 
invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly 
15 allergic asthma) or other respiratory problems. Other 
conditions, in which immune suppression is desired 
(including, for example, organ transplantation), may also be 
treatable using a protein of the present invention. 

Using the proteins of the invention it may also be 
20 possible to immune responses, in a number of ways. Down 
regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve 
preventing the induction of an immune response. The 
functions of activated T cells may be inhibited by 
25 suppressing T cell responses or by inducing specific 
tolerance in T cells, or both. Immunosuppression of T cell 
responses is generally an active, non-antigen-specific, 
process which requires continuous exposure of the T cells to 
+. ^ , ,^. r , T - 00 0 4 , n ^pnf Tn] pr-^nnp whi rh "involves indue ina 

from immunosuppression m thai : ; is generally antigen- 
specific and persists after exposure to the tolerizing agent 
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has ceased. Operationally, tolerance can be demonstrated by 
the lack of a T cell response upon reexposure to specific 
antigen in the absence of the tolerizing agent. 

Down regulating or preventing one or more antigen 
5 functions (including without limitation B lymphocyte antigen 
functions (such as , for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will 
be useful in situations of tissue, skin and organ 
transplantation and in graf t-versus-host disease (GVHD). 
10 For example, blockage of T cell function should result in 
reduced tissue destruction in tissue transplantation. 
Typically, in tissue transplants, rejection of the 
transplant is initiated through its recognition as foreign 
by T cells, followed by an ijnmune reaction that destroys the 
15 transplant. The administration of a molecule which inhibits 
or blocks interaction of a B7 lymphocyte antigen with its 
natural ligand(s) on immune cells (such as a soluble, 
monomeric form of a peptide having B7-2 activity alone or in 
conjunction with a monomeric form of a peptide having an 
20 activity of another B lymphocyte antigen (e.g., B7-1, B7-3) 
or blocking antibody), prior to transplantation can lead to 
the binding of the molecule to the natural ligand(s) on the 
immune cells without transmitting the corresponding 
costimulatory signal. Blocking B lymphocyte antigen 

25 function in this matter prevents cytokine synthesis by 
immune cells, such as T cells, and thus acts as an 
immunosuppressant. Moreover, the lack of costimulation may 
also be sufficient to anergize the T cells, thereby inducing 
^ninran^n -> c-^-^^ v Tnrjij^f i or of 1 o n ri — t e TTTi tolerance by 

necessity ci repeated administration c: these blocranq 
reagents- To achieve sufficient immunosuppression or 
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tolerance in a subject, it may also be necessary to block 
the function of a combination of B lymphocyte antigens. 

The efficacy of particular blocking reagents in 
preventing organ transplant rejection or GVHD can be 
5 assessed using animal models that are predictive of efficacy 
in humans . Examples of appropriate systems which can be 
used include allogeneic cardiac grafts in rats and 
xenogeneic pancreatic islet cell grafts in mice, both of 
which have been used to examine the immunosuppressive 

10 effects of CTLA4Ig fusion proteins in vivo as described in 
Lenschow et al., Science 257 :789-792 (1992) and Turka et al. f 
Proc. Natl. Acad. Sci USA, 89:11102-11105 (1992). In 
addition, murine models of GVHD (see Paul ed. , Fundamental 
immunology, Raven Press, New York, 1989, pp. 846-847) can be 

15 used to determine the effect of blocking B lymphocyte 
antigen function in vivo on the development of that disease. 

Blocking antigen function may also be therapeutically 
useful for treating autoimmune diseases. Many autoimmune 
disorders are the result of inappropriate activation of T 

20 cells that are reactive against self tissue and which 
promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate 
disease symptoms. Administration of reagents which block 

25 costimulation of T cells by disrupting receptor : ligand 
interactions of B lymphocyte antigens can be used to inhibit 
T cell activation and prevent production of autoantibodies 
or T cell-derived cytokines which may be involved in the 

3 : r . .. ... ^ ,-, ^ ^ i- ^! ; + — ' - . r , ~s l f - - Kl n<-V^ yp^nn^^c TT\c5 V "in dll CP 

could .eac ic ,.ong-teni; relief iron: the aisease. The 
efficacy of blocking reagents in preventing or alleviating 
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autoimmune disorders can be determined using a number of 
well-characterized animal models of human autoimmune 
diseases. Examples include murine experimental autoimmune 
encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr 
5 mice or NZB hybrid mice, murine autoimmune collagen 
arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myasthenia gravis (see Paul ed. , 
Fundamental Immunology, Raven Press, New York, 1989, pp. 
840-856) . 

10 Upregulation of an antigen function (preferably a B 

lymphocyte antigen function), as a means of up regulating 
immune responses, may also be useful in therapy. 
Upregulation of immune responses may be in the form of 
enhancing an existing immune response or eliciting an 

15 initial immune response. For example, enhancing an immune 
response through stimulating B lymphocyte antigen function 
may be us ef ul in cas es of viral infection . In addition , 
systemic viral diseases such as influenza, the commoncold, 
and encephalitis might be alleviated by the administration 

20 of stimulatory forms of B lymphocyte antigens systemically . 

Alternatively, anti-viral immune responses may be 
enhanced in an infected patient by removing T cells from the 
patient, cost emulating the T cells in vitro with viral 
antigen-pulsed APCs either expressing a peptide of the 

25 present invention or together with a stimulatory form of a 
soluble peptide of the present invention and reintroducing 
the in vitro activated T cells into the patient. Another 
method of enhancing anti-viral immune responses would be to 
: ,~ - 1 n r ' > * r-, - + ? - o 1 1 ~ * — ~ - n^ipr f +■ ""~ a n p f er t- 1- h ^vr with a 

described herein sucn mat the r'elis express all or a 
portion of the protein on their surface, and reintroduce the 
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transfected cells into the patient. The infected cells 
would now be capable of delivering a costimulatory signal to, 
and thereby activate, T cells in vivo. 

In another application, up regulation or enhancement of 
5 antigen function (preferably B lymphocyte antigen function) 
may be useful in the induction of tumor immunity. Tumor 
cells (e.g., sarcoma, melanoma, lymphoma, leukemia, 
neuroblastoma, carcinoma) transfected with a nucleic acid 
encoding at least one peptide of the present invention can 

10 be administered to a subject to overcome tumor-specific 
tolerance in the subject. If desired, the tumor cell can be 
transfected to express a combination of peptides. For 
example, tumor cells obtained from a patient can be 
transfected ex vivo with an expression vector directing the 

15 expression of a peptide having B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity 
and/or B7-3-like activity. The transfected tumor cells are 
returned to the patient to result in expression of the 
peptides on the surface of the transfected cell. 

20 Alternatively, gene therapy techniques can be used to target 
a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention 
having the activity of a B lymphocyte antigen(s) on the 
surface of the tumor cell provides the necessary 

25 costimulation signal to T cells to induce a T cell mediated 
immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II 
molecules, or which fail to reexpress sufficient amounts of 
wtr" - • . w ---> t t rrol pru 1 e? , fan be transfected 

cy-copiasmic-ciomain truncated portion) ol an MHC :>ass 
chain protein and 2 microglobulin protein or an MHC class 
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II chain protein and an MHC class II chain protein to 
thereby express MHC class I or MHC class II proteins on the 
cell surface. Expression of the appropriate class I or 
class II MHC in conjunction with a peptide having the 
5 activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) 
induces a T cell mediated immune response against the 
trans fected tumor cell. Optionally, a gene encoding an 
antisense construct which blocks expression of an MHC class 
II associated protein, such as the invariant chain, can also 

10 be cotrans fected with a DNA encoding a peptide having the 
activity of a B lymphocyte antigen to promote presentation 
of tumor associated antigens and induce tumor specific 
immunity. Thus , the induction of a T cell mediated immune 
response in a human subject may be sufficient to overcome 

15 tumor-specific tolerance in the subject. 

The activity of a protein of the invention may, among 
other means, be measured by the following methods: 

Suitable assays for thymocyte or splenocyte 
cytotoxicity include, without limitation, those described 

20 in: Current Protocols in Immunology, Ed by J. E. Coligan, 
A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. 
Greene Publishing Associates and Wiley-Interscience (Chapter 
3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., 

25 Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981; Herrmann et 
al. , J. Immunol. 128 : 196 8-1974, 1982 ; Handa et al. , J. 
Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 
137:3494-3500, 1986; Takai et al . , J. Immunol. 140:508-512, 
1988; Herrmann et al . , Proc, Natl. Acad. Sci. USA 78 : 2488- 



. 9 8 2. ; Handa el :^ . , Z. Immunox . 1 3 b : I b 64 - 1 5 7 2 , 198b; Takai. 
et al., J. Immunol. 137 : 3494-3500, 1986 ; Bowmanet al., J. 
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Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 
1988; Bertagnolli et al . , Cellular Immunology 133:327-341, 
1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 

Assays for T-cell-dependent immunoglobulin responses 
and isotype switching (which will identify, among others, 
proteins that modulate T-cell dependent antibody responses 
and that affect Thl/Th2 profiles) include, without 
limitation, those described in: Maliszewski, J. Immunol. 
144:3028-3033, 1990; and Assays for B cell function: In 
vitro antibody production, Mond, J.J. and Brunswick, M. In 
Current Protocols in Immunology. J.E.e.a. Coligan eds . Vol 1 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will 
identify, among others, proteins that generate predominantly 
Thl and CTL responses) include, without limitation, those 
described in: Current Protocols in Immunology, Ed by J. E. 
Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W 
Strober, Pub. Greene Publishing Associates and Wiley- 
interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies 
in Humans); Takai et al. , J. Immunol. 137:3494-3500, 1986; 
Takai et al. , J. Immunol. 140:508-512, 1988; Bertagnolli et 
al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, 
25 among others, proteins expressed by dendritic cells that 
activate naive T-cells ) include, without limitation, those 
described in: Guery et al., J. Immunol. 134:536-544, 1995; 
inaba et al., Journal of Experimental Medicine 173:549-559, 

-! Trvirnal of Immunology 154:5071-5079, 



15 



20 



<->+• Ft ' 
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.82:255-260, 1995; Nai: <>x - lourna, .i . .rux^ 

7:4062-4069, 1993; Huang et al., Science 264:961-965, 
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1994; Macatonia et al., Journal of Experimental Medicine 
169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical 
Investigation 94:797-807, 1994; and Inaba et al., Journal 
of Experimental Medicine 172:631-640, 1990. 
5 Assays for lymphocyte survival/apoptosis (which will 

identify, among others, proteins that prevent apoptosis 
after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those 
described in: Darzynkiewicz et al., Cytometry 13:795-808, 

10 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et 
al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 
66:233-243, 1991; Zacharchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 
1993; Gorczyca et al . , International Journal of Oncology 

15 1:639-648, 1992. 

Assays for proteins that influence early steps of T- 
cell commitment and development include, without limitation, 
those described in: Antica et al., Blood 84:111-117, 1994; 
Fine et al., Cellular Immunology 155:111-122, 1994; Galy et 

20 al., Blood 85:2770-2778, 1995; Toki et al., Proc. Nat. Acad 
Sci. USA 88:7548-7551, 1991. 

Hematopoies i s Regulating Activity 

A protein of the present invention may be useful in 
regulation of hematopoiesis and, consequently, in the 
25 treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming 
cells or of factor-dependent cell lines indicates 
involvement in regulating hematopoiesis, e.g. in supporting 



.naicatmq utility, lor example, in treating various anemxas 
or for use in conjunction with irradiation/chemotherapy to 
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stimulate the production of erythroid precursors and/or 
erythroid cells; in supporting the growth and proliferation 
of myeloid cells such as granulocytes and 
monocytes /macrophages (i.e. , traditional CSF activity ) 
5 useful, for example, in conjunction with chemotherapy to 
prevent or treat consequent myelo-suppression; in supporting 
the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or 
treatment of various platelet disorders such as 

10 thrombocytopenia, and generally for use in place of or 
complimentary to platelet transfusions; and/or in supporting 
the growth and proliferation of hematopoietic stem cells 
which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic 

15 utility in various stem cell disorders (such as those 
usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal 
hemoglobinuria), as well as in repopulating the stem cell 
compartment post irradiation/chemotherapy, either in-vivo or 

20 ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell 
transplantation (homologous or heterologous) ) as normal 
cells or genetically manipulated for gene therapy. 

The activity of a protein of the invention may, among 

25 other means, be measured by the following methods: 

Suitable assays for proliferation and differentiation 
of various hematopoietic lines are cited above. 

Assays for embryonic stem cell differentiation (which 

. iiy.: tatio:; , 'hose aescribea in: Johansson et ^ i . "ellular 
Biology 15:141-151, 1995; Keller et al., Molecular and 
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Cellular Biology 13:473-486, 1993; McClanahan et al.. Blood 

81:2903-2915, 1993. 

Assays for stem cell survival and differentiation 
(which will identify, among others, proteins that regulate 
5 lympho-hematopoiesis) include, without Imitation, those 
described in: Methylcellulose colony forming assays, 
Freshney, M.G. In Culture of Hematopoietic Cells. R.I. 
Freshney, et al. eds . Vol pp. 265-268, Wiley-Liss, Inc., 
New York, NY. 1994; Hirayama et al., Proc. Natl. Acad. sci. 
10 USA 89:5907-5911, 1992; Primitive hematopoietic colony 
forming cells with high proliferative potential, McNiece, 
I.K. and Briddell, R.A. in Culture of Hematopoietic Cells. 
R.I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., 
New York, NY. 1994; Neben et al., Experimental Hematology 
22:353-359, 1994; Cobblestone area forming cell assay, 
Ploemacher, R.E. In Culture of Hematopoietic Cells. R.I. 
Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New 
York, NY. 1994; Long term bone marrow cultures in the 
presence of stromal cells, Spooncer, E., Dexter, M. and 
Allen, T. In Culture of Hematopoietic Cells. R.I. Freshney, 
et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, NY. 
1994; Long term culture initiating cell assay, Sutherland, 
H.J. in Culture of Hematopoietic Cells. R.I. Freshney, et al. 
eds. vol pp. 139-162, Wiley-Liss, Inc., New York, NY. 1994. 
25 Tissue c r™*^ Activity 

A protein of the present invention also may have 
utility in compositions used for bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well 

and Hssue repair and replacement, and 



15 



20 



A protein : the present invention, ,.nici. 
cartilage and/or bone growth in circumstances where bone is 
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not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and 
other animals . Such a preparation employing a protein of 
the invention may have prophylactic use in closed as well as 
5 open fracture reduction and also in the improved fixation of 
artificial joints. De novo bone formation induced by an 
osteogenic agent contributes to the repair of congenital, 
trauma induced, or oncologic resection induced craniofacial 
defects , and also is useful in cosmetic plastic surgery . 

10 A protein of this invention may also be used in the 

treatment of periodontal disease, and in other tooth repair 
processes. Such agents may provide an environment to 
attract bone- forming cells , stimulate growth of bone- forming 
cells or induce differentiation of progenitors of bone- 

15 forming cells . A protein of the invention may also be 
useful in the treatment of osteoporosis or osteoarthritis, 
such as through stimulation of bone and/or cartilage repair 
or by blocking inflammation or processes of tissue 
destruction ( collagenase activity, osteoclast activity, 

20 etc.) mediated by inflammatory processes. 

Another category of tissue regeneration activity that 
may be attributable to the protein of the present invention 
is tendon/ligament formation. A protein of the present 
invention, which induces tendon/ligament-like tissue or 

25 other tissue formation in circumstances where such tissue is 
not normally formed, has application in the healing of 
tendon or ligament tears , deformities and other tendon or 
ligament defects in humans and other animals. Such a 

lenaon c;.. ^iqamei;: tissue, as wei A as use _ia the unproved 
fixation of tendon or ligament to bone or other tissues, and 
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in repairing defects to tendon or ligament tissue. De novo 
tendon/ligament-like tissue formation induced by a 
composition of the present invention contributes to the 
repair of congenital, trauma induced, or other tendon or 
5 ligament defects of other origin, and is also useful in 
cosmetic plastic surgery for attachment or repair of tendons 
or ligaments. The compositions of the present invention may 
provide an environment to attract tendon or ligament- forming 
cells, stimulate growth of tendon- or ligament-forming cells, 
10 induce differentiation of progenitors of tendon- or 
ligament- forming cells, or induce growth of tendon/ligament 
cells or progenitors ex vivo for return in vivo to effect 
tissue repair. The compositions of the invention may also 
be useful in the treatment of tendinitis, carpal tunnel 
15 syndrome and other tendon or ligament defects. The 
compositions may also include an appropriate matrix and/or 
sequestering agent as a carrier as is well known in the art. 

The protein of the present invention may also be useful 
for proliferation of neural cells and for regeneration of 
20 nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as 
well as mechanical and traumatic disorders, which involve 
degeneration, death or trauma to neural cells or nerve 
tissue. More specifically, a protein may be used in the 
25 treatment of diseases of the peripheral nervous system, such 
as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, 
such as Alzheimer's, Parkinson's disease, Huntington's 
-i ■• c-.-.-e-,-. •imv">-'-oDh^ ^t-pral prlerosis, and Shy-Drager 



accordance with the present invention include mechanical ana 
traumatic disorders, such as spinal cord disorders, head 
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trauma and cerebrovascular diseases such as stroke. 
Peripheral neuropathies resulting from chemotherapy or other 
medical therapies may also be treatable using a protein of 
the invention. 

5 Proteins of the invention may also be useful to promote 

better or faster closure of non-healing wounds, including 
without limitation pressure ulcers, ulcers associated with 
vascular insufficiency, surgical and traumatic wounds, and 
the like. 

10 It is expected that a protein of the present invention 

may also exhibit activity for generation or regeneration of 
other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endothelium), 
muscle (smooth, skeletal or cardiac) and vascular (including 

15 vascular endothelium) tissue, or for promoting the growth of 
cells comprising such tissues. Part of the desired effects 
may be by inhibition or modulation of fibrotic scarring to 
allow normal tissue to regenerate. A protein of the 
invention may also exhibit angiogenic activity. 

20 A protein of the present invention may also be useful 

for gut protection or regeneration and treatment of lung or 
liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful 

25 for promoting or inhibiting differentiation of tissues 
described above from precursor tissues or cells; or for 
inhibiting the growth of tissues described above. 

The activity of a protein of the invention may, among 

Ltiose described m : Internationa. latent 
No. WO95/16035 (bone, cartilage, tendon); 



limitation , 
Publication 
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International Patent Publication No. WO95/05846 (nerve, 
neuronal ) ; International Patent Publication No, WO91/07491 
( skin , endothelium ) . 

Assays for wound healing activity include, without 
5 limitation, those described in: Winter, Epidermal Wound 
Healing, pps . 71-112 (Maibach, HI and Rovee, DT, eds.), Year 
Book Medical Publishers , Inc . , Chicago , as modified by 
Eaglstein and Mertz, J. Invest. Dermatol 71:382-84 (1978). 
Activin/Inhibin Activity 

10 A protein of the present invention may also exhibit 

activin- or inhibin-related activities. Inhibins are 
characterized by their ability to inhibit the release of 
follicle stimulating hormone (FSH), while activins and are 
characterized by their ability to stimulate the release of 

15 follicle stimulating hormone (FSH). Thus, a protein of the 
present invention, alone or in heterodimers with a member of 
the inhibin family, may be useful as a contraceptive based 
on the ability of inhibins to decrease fertility in female 
mammals and decrease spermatogenesis in male mammals. 

20 Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the 
protein of the invention, as a homodimer or as a heterodimer 
with other protein subunits of the inhibin- group, may be 
useful as a fertility inducing therapeutic, based upon the 

25 ability of activin molecules in stimulating FSH release from 
cells of the anterior pituitary. See, for example, United 
States Patent 4,798,885. A protein of the invention may 
also be useful for advancement of the onset of fertility in 



sheep ana pigs . 

The activity of a protein of the invention may, among 
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other means, be measured by the following methods: 

Assays for activin/inhibin activity include, without 

limitation, those described in: Vale et al., Endocrinology 

91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; 
5 Vale et al,, Nature 321:776-779, 1986; Mason et al., Nature 

318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. USA 

83:3091-3095, 1986. 

chCTiotact 1c /Chemokinet i c Act ivitV 

A protein of the present invention may have chemotactic 
10 or chemokinetic activity (e.g., act as a chemokine) for 
mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, 
epithelial and/or endothelial cells. Chemotactic and 
chemokinetic proteins can be used to mobilize or attract a 
15 desired cell population to a desired site of action. 
Chemotactic or chemokinetic proteins provide particular 
advantages in treatment of wounds and other trauma to 
tissues, as well as in treatment of localized infections. 
For example, attraction of lymphocytes, monocytes or 
20 neutrophils to tumors or sites of infection may result in 
improved immune responses against the tumor or infecting 
agent . 

A protein or peptide has chemotactic activity for a 
particular cell population if it can stimulate, directly or 
25 indirectly, the directed orientation or movement of such 
cell population. Preferably, the protein or peptide has the 
ability to directly stimulate directed movement of cells. 
Whether a particular protein has chemotactic activity for a 

chemo taxis . 

The activity of a protein of the invention may, among 
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other means, be measured by the following methods: 

Assays for chemotactic activity (which will identify 
proteins that induce or prevent chemotaxis) consist of assays 
that measure the ability of a protein to induce the 
mig ration of cells across a membrane as well as the ability 
of a protein to induce the adhesion of one cell population 
to another cell population. Suitable assays for movement and 
adhesion include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J.E. Coligan, A.M. 
Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. 
Greene Publishing Associates and Wiley-Interscience (Chapter 

4- r\-f ainha and beta Chemokines 6.12.1- 
6 12 Measurement of alpna ana 

S " 12.28, *aub * al. J. Clin. Invest. 95 = 1370-1376, 1995; 
L ind et al- APMIS 103:140-146, 1995; Muller et al Eur. J- 

Immunol. 25= 1744-1748; Gruber et al. a. of —1. 

152 = 5860-5867, 1994; Johnston et al. J. of Immunol. 153 = 

1762-1768, 1994. 

n- nr i ^ti,- »r.H Thrombolytic privity 

A protein of the invention may also exhibit hemostatic 
or thrombolytic activity. As a result,such a protein xs 
expected to be useful in treatment of various coagulation 
disorders , includinghereditary disorders, such as 
hemophilias) or to enhance coagulation and other hemostatic 
events in treating wounds resulting from trauma, surgery or 
other causes. A protein of the invention may also be useful 
for dissolving or inhibiting formation of thromboses and for 
treatment and prevention of conditions resulting therefrom 
(such as, for example, infarction of cardiac and central 

; . c . r i r- f n n e +" T~oke ) . 

-■n-v mP ans. be measured by the following methods : 

Assay for hemostatic and thrombolytic activity include, 
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without limitation, those described in: Linet et al., J. 
Clin. Pharmacol. 26: 131-140, 1986; Burdick et al. f 
Thrombos is Res . 45:413-419, 1987; Humphrey et al . , 
Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 35:467- 
5 474, 1988. 

Receptor/Liganti Activity 

A protein of the present invention may also demonstrate 
activity as receptors , receptor ligands or inhibitors or 
agonists of receptor /I igand interactions . Examples of such 

10 receptors and ligands include, without limitation, cytokine 
receptors and their ligands , receptor kinases and their 
ligands , receptor phosphatases and their ligands , receptors 
involved in cell-cell interactions and their ligands 
( including without limitation, cellular adhesion molecules 

15 (such as selectins, integrins and their ligands) and 
receptor /ligand pairs involved in antigen presentation, 
antigen recognition and development of cellular and humoral 
immune responses ) . Receptors and ligands are also useful 
for screening of potential peptide or small molecule 

20 inhibitors of the relevant receptor/ligand interaction. A 
protein of the present invention ( including , without 
limitation, fragments of receptors and ligands) may 
themselves be useful as inhibitors of receptor/ligand 
interactions . 

25 The activity of a protein of the invention may, among 

other means, be measured by the following methods: 

Suitable assays for receptor-ligand activity include 
without limitation those described in : Current Protocols in 



Associates a;;a Wiley-Interscience v Chapter .26, Measurement 
of Cellular Adhesion under static conditions 7.28.1-7.28.22), 
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Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 
Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein 
et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et 
al., *J. Immunol. Methods 175:59-68, 1994; Stitt et al., 
5 Cell 80:661-670, 1995. 

Ant i -Inflammatory Activity 

Proteins of the present invention may also exhibit 
ant i- inflammatory activity. The anti-inflammatory activity 
may be achieved by providing a stimulus to cells involved in 

^ . • _n j - — i nhi^-i t i nrr nr nrnmnf inn r?f»1 1 — 

iU I-Iie -Lil-L -LdULilia L.UJ. _y -L CD^JUllo ^ f J^jr — - — « ^ IT J 

cell interactions (such as, for example, cell adhesion), by 
inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell 
extravasation, or by stimulating or suppressing production 

15 of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities 
can be used to treat inflammatory conditions including 
chronic or acute conditions), including without limitation 
inflammation associated with infection (such as septic shock, 

20 sepsis or systemic inflammatory response syndrome (SIRS)), 
ischemia-reperfusion injury, endotoxin lethality, arthritis, 
complement-mediated hyperacute rejection, nephritis , 
cytokine or chemokine-induced lung injury, inflammatory 
bowel disease, Crohn's disease or resulting from over 

25 production of ytokines such as TNF or IL-1. Proteins of the 
invention may also be useful to treat anaphylaxis and 
hypersensitivity to an antigenic substance or material. 

Tumor inhibition Activity 

30 i_mmu nologicai treatment or prevention ci tumors , o protein 
of the invention may exhibit other anti-tumor activities. A 
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protein may inhibit tumor growth directly or indirectly 
(such as, for example, via ADCC). A protein may exhibit its 
tumor inhibitory activity by acting on tumor tissue or tumor 
precursor tissue, by inhibiting formation of tissues 
necessary to support tumor growth (such as, for example, by 
inhibiting angiogenesis ) , by causing production of other 
factors, agents or cell types which inhibit tumor growth, or 
by suppressing, eliminating or inhibiting factors, agents or 
cell types which promote tumor growth 



A protein of the invention may also exhibit one or more 
of the following additional activities or effects: 
inhibiting the growth, infection or function of, or killing, 
infectious agents, including, without limitation, bacteria , 

15 viruses , fungi and other parasites ; effecting ( suppressing 
or enhancing ) bodily characteristics , including , without 
limitation, height, weight, hair color, eye color, skin, fat 
to lean ratio or other tissue pigmentation, or organ or body 
part size or shape ( such as , for example , breast 

20 augmentation or diminution, change in bone form or shape) ; 
effecting biorhythms or caricadic cycles or rhythms; 
effecting the fertility of male or female subjects ; 
effecting the metabolism, catabolism, anabolism, processing, 
utilization, storage or elimination of dietary fat , lipid, 

25 protein , carbohydrate , vitamins , minerals , cof actor s or 
other nutritional factors or component (s) ; effecting 
behavioral characteristics , including, without limitation, 
appet ite , 1 i bido , stress , roan i t ion M ncl ud i na roar i t i vp 

■ lij violent oenaviors; providing analgesic effects or other pain 
reducing effects ; promoting differentiation and growth of 
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embryonic stem cells in lineages other than hematopoietic 
lineages; hormonal or endocrine activity; in the case of 
enzymes , correcting deficiencies of the enzyme and treating 
deficiency-related diseases ; treatment of hyperprolif erative 
5 disorders (such as, for example, psoriasis); immunoglobulin- 
like activity ( such as , for example , the ability to bind 
antigens or complement); and the ability to act as an 
antigen in a vaccine composition to raise an immune response 
against such protein or another material or entity which is 
10 cross-reactive with such protein. 

Examples 

The present invention is specifically illustrated in 
more detail by the following Examples, but Examples are not 

15 intended to restrict the present invention. The basic 
operations with regard to the recombinant DNA and the 
enzymatic reactions were carried out according to the 
literature ["Molecular Cloning. A Laboratory Manual", Cold 
Spring Harbor Laboratory, 1989 ] . Unless otherwise stated, 

20 restrictive enzymes and a variety of modification enzymes to 
be used were those available from Takara Shuzo. The buffer 
compositions and the reaction conditions for each of the 
enzyme reactions were as described in the manufacturer ' s 
instructions . The cDNA synthesis was carried out according 

25 to the literature [Kato, S. et al. , Gene 150 : 243-250 
( 1994 ) ]. 

(1) Selection of cDNAs Encoding Proteins Having Hydrophobic 
Domains 

^aos-. ; W09 7 / 3 3 993 , , the cDNA library o: osteosarcoma cell 
line U-2 OS (W098/21328 ) , the cDNA library of epidermoid 
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carcinoma cell line KB (W098/11217 ) , the cDNA library of 
tissues of stomach cancer delivered by the operation 
(W098/21328) , the cDNA library of liver tissue delivered by 
the operation (W098/21328 ) , and were used for the cDNA 
5 libraries. Full-length cDNA clones were selected from 
respective libraries and the whole base sequences thereof 
were determined to construct a homo-protein cDNA bank 
consisting of the full-length cDNA clones. The 
hydrophobicity/hydrophilicity profiles were determined for 

10 the proteins encoded by the full-length cDNA clones 
registered in the homo-protein cDNA bank by the Kyte- 
Doolittle method [Kyte, J. & Doolittle, R. F., J. Mol. Biol. 
157: 105-132 (1982)] to examine the presence or absence of a 
hydrophobic region. Any clone that has a hydrophobic region 

15 being putative as a secretory signal or a transmembrane 
domain in the amino acid sequence of the encoded protein was 
selected as a clone candidate. 

(2) Protein Synthesis by In Vitro Translation 

The plasmid vector bearing the cDNA of the present 

20 invention was used for in vitro transcription/translation 
with a T N T rabbit reticulocyte lysate kit (Promega). In this 
case, [ 35 S]methionine was added to label the expression 
product with a radioisotope. Each of the reactions was 
carried out according to the protocols attached to the kit. 

25 Two micrograms of the plasmid was subjected to the reaction 
at 30 °C for 90 minutes in the reaction solution of a total 
volume of 25 ^1 containing 12.5 yl u of T N T rabbit 
reticulocyte lysate, 0.5 /j1 of a buffer solution (attached 



u.b u± oL TV RL : A polymerase, ana 2 0 U o: RNasm. Also, an 
experiment in the presence of a membrane system was carried 
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out by adding to this reaction system 2.5 jjI of a canine 
pancreas microsome fraction (Promega). To 3 yl of the 
resulting reaction solution was added 2 jjI of the SDS 
sampling buffer (125 mM Tr is -hydrochloric acid buffer, pH 
5 6.8, 120 mM 2-mercaptoethanol , 2% SDS solution, 0.025% 
bromophenol blue, and 2 0% glycerol) and the resulting 
mixture was heated at 95 °C for 3 minutes and then subjected 
to SDS-polyacrylamide gel electrophoresis. The molecular 
weight of the translation product was determined by carrying 

10 out the autoradiography. 
(3) Expression by C0S7 

Escherichia coli cells bearing the expression vector 
for the protein of the present invention was incubated at 
37 °C for 2 hours in 2 ml of the 2xYT culture medium 

15 containing 100 pg/ml of ampicillin, the helper phage M13K07 
(50 /i 1) was added, and the incubation was continued at 37°C 
overnight . A supernatant separated by centrif ugation 
underwent precipitation with polyethylene glycol to obtain 
single-stranded phage particles. These particles were 

20 suspended in 100 pi of 1 mM Tris-0.1 mM EDTA, pH 8 (TE ) . 

The cultured cells derived from simian kidney, COS7 , 
were incubated at 37 °C in the presence of 5% C0 2 in the 
Dulbecco's modified Eagle's culture medium (DMEM) containing 
10% fetal calf serum. Into a 6-well plate (Nunc, well 

25 diameter: 3 cm) were inoculated with 1 x 10 5 C0S7 cells and 
incubation was carried out at 37 °C for 22 hours in the 
presence of 5% C0 2 . After the culture medium was removed, 
the cell surface was washed with a phosphate buffer solution 
and then washed again with DMFM rnp^^'n^- m ^v* -> - 

was uaaea <: suspension o: u^.. ol the s mgie-st randed phage 
suspension, 0.6 ml of the DMEM culture medium, and 3 jjI of 
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TRANS FECTAM™ (IBF) and the resulting mixture was incubated 
at 37 °C for 3 hours in the presence of 5% C0 2 . After the 
sample solution was removed, the cell surface was washed 
with TDMEM, 2 ml per well of DMEM containing 10% fetal calf 
5 serum was added , and the incubation was carried out at 37 °C 
for 2 days in the presence of 5% C0 2 . After the culture 
medium was replaced by a culture medium containing 
[ 35 S ] cystine or [ 35 S ] methionine , the incubation was carried 
out for one hour. After the culture medium and the cells 
10 were separated by centrif ugation, proteins in the culture 
medium fraction and the cell-membrane fraction were 
subjected to SDS-PAGE. 
(4) Clone Examples 

<HP01550> (SEQ ID Nos . 1, 11, and 21) 

15 Determination of the whole base sequence of the cDNA 

insert of clone HP01550 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 65-bp 
5 '-untranslated region, a 378-bp ORF, and a 67-bp 3'- 
untranslated region. The ORF codes for a protein consisting 

20 of 125 amino acid residues and there existed one putative 
transmembrane domain. Figure 1 depicts the 

hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation ucr.duct 

25 of 15 kDa that was almost identical with the mo 2 ocular 
weight of 13,825 predicted from the ORF. 

The search of the protein data base using tba amino 
acid sequence of the present protein revealed that the 
protein was similar to the CaenorhabH^^ ' -Tan 9 

"lame . snows tne comparison netween ammo aci-: ;o,:„ • *es c : 
the human protein of the present invention ( HP ) and the C . 
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elegans hypothetical protein F4 5G2 . c ( CE ) . Therein , the 
marks of *, and . represent a gap, an amino acid residue 
identical with that of the protein of the present invention/ 
and an amino acid residue similar to that of the protein of 
the present invention, respectively. The both proteins 
shared a homology of 44,5% in the entire region. 

Table 2 



HP MAKYLAQIIVMGVQWGRAFARALRQEF AASRAAADARGRAGHRSAAASNLS- 

*..*..**.*.*. **..** *..*.** . 

CE MPWRTALKVALAAGEAVAKALTRA 

HP GLSLQEAQQILNV-SKLSPEEVQKNYEHLFKVNDKSVGGSFYLQSKVVRAKERLDEEL-K 
*.**.*. ***** . ***** # ***^ 

CE GISLEESLQILNVKTPLNREEVEKHYEHLFNINDKSKGGTLYLQSKVFRAKERIDEEFGR 
HP IQAQEDREKGQMPHT 

* • .*...*.. 
CE IELKEEKKKEENAKTE 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more ( for 
example, Accession No. AA338859) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP02593> (SEQ ID Nos. 2, 12, and 22) 

Determination of the whole base seauence of the cDNA 

osteosarcoma ::eii i xne .^aos-. reveaiea t tie structure 
consisting of a 103-bp 5 ' -untranslated region, a 396-bp ORF, 
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and a 198-bp 3 '-untranslated region. The ORF codes for a 
protein consisting of 131 amino acid residues and there 
existed four putative transmembrane domains at the C- 
terminus. Figure 2 depicts the hydrophobicity/hydrophilicity 
5 profile, obtained by the Kyte-Doolittle method, of the 
present protein. In vitro translation resulted in formation 
of a translation product of a high molecular weight. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 

10 protein was similar to a human OB-R gene-related protein 
(EMBL Accession No. Y12670) . Table 3 shows the comparison 
between amino acid sequences of the human protein of the 
present invention (HP) and the human OB-R gene-related 
protein ( OB ) . Therein , the marks of * , and . represent a 

15 gap, an amino acid residue identical with that of the 
protein of the present invention, and an amino acid residue 
similar to that of the protein of the present invention, 
respectively. The both proteins shared a homology of 67.9% 
in the entire region. 

20 

Table 3 



HP MAGIKALISLSFGGAIGLMFLMIjGCA^ 

*** *** ★ ** ***** ******** * ******* * **** ** * * * 

• • * • ■ • • • • m m m m 

25 OB MAGVKALVALSFSGAIGLTFL>ILGC^ 

HP AMSNACKELAIFLTTGIWSAFGLPIVFARAHLIEWGACALVLTGNTVIFATILGFFLVF 
* * ** *** * ********** * ** * **** *** ** *** ** **** * 

OB ATSSACRELAYFFTTGIVVSAFGFPVILARVAVIKWGACGLVLAGNAVIFLTIQGFFLIF 
HP GSNDDFSWQQW 
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Furthermore/ the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example , Accession No. AA306490) in ESTs, but, since they 
5 are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP10195> (SEQ ID Nos . 3, 13, and 23) 

10 Determination of the whole base sequence of the cDNA 

insert of clone HP10195 obtained from cDNA library of human 
fibrosarcoma HT-1080 revealed the structure consisting of a 
286-bp 5 ' -untranslated region, a 729-bp ORF, and a 604-bp 
3 '-untranslated region. The ORF codes for a protein 

15 consisting of 242 amino acid residues and there existed one 
putative transmembrane domain at the C-terminus. Figure 3 
depicts the hydrophobicity/hydrophilicity profile, obtained 
by the Kyte-Doolittle method, of the present protein. In 
vitro translation resulted in formation of a translation 

20 product of 32 kDa that was somewhat larger than the 
molecular weight of 27,3 00 predicted from the ORF. When 
expressed in COS7 cells, an expression product of about 21 
kDa was observed in the supernatant fraction and the 
membrane fraction . 

25 The search of the protein data base using the amino 

acid sequence of the present protein has revealed the 
registration of sequences that were similar to the Aplysia 
VAP-33 (SWISS-PROT Accession No. P53173). Table 4 shows the 
comparison between amino acid sequences of the human protein 



; nerei:. , tine marKs or * f ana represent a gap, an ammo 

acid residue identical with that of the protein of the 
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5 



present invention, and an amino acid residue similar to that 
of the protein of the present invention , respectively . The 
both proteins shared a homology of 4 6.5% in the entire 
region . 

Table 4 



HP MAKHEQILVLDPPTDLKFKGPFTDWTTNLKLRNPSDRKVCFKVKTTAPRRYCVRPNSGI 

• * ^ * * * *.*.*. . .*.**********. # ***^**^ **. . ********* 4 ********** 

10 AP MASHEQALILEPAGELRFKGPFTDWTADLKLSNPTDRRICFKVKTTAPKRYCVRPNSGI 

ZJTi TT\Tir> C mi Trm T CI T\MT AT>T?ri Vr\TMTT?T/r T/TTT7T?»fT ;AmTt» W nrHTmrn \/rtri is TTT.TtfWH T,rT"ir^T-i-r « *~t-x ^ vr-o 

★ * ****** ****** ******* ** * ** * * *** ★* 

• • ••••• • • ••■ * • 

AP LEPKTSIAVAVMLQPFNYDPNEKNKHKFMVQSMYAPDHWESQELLWKDAPPESLMDTKL 

HP RCVFEMPNENDKLNDMEPSK AVPLNASKQDGPMPKP-HSVSLNDTE 

15 ******* *. * . ## **. b * 

AP RCVFEMPDGSHQAPASDASRATDAGAHFSESALEDPTVASRKTETQSPKRVGAVGSAGED 
HP TRKLMEECKRLQGEMMKLSEENRHLRDEGLRLRKVAHSD — KPGSTSTASFRDNVTSPLP 
** * * * * * ** * *** ****** * * * 

AP VKKLQHELKKAQSEITSLKGENSQLKDEGIRLRKVAMTDTVSPTPLNPSPAPAAAVRAFP 
20 HP SLLWIAAIFIGFFLGKFIL 

• • . *.***..*...***.* 
AP PWYWAAIILGLIIGKFLL 



25 Furthermore, the search of the GenBank using the base 

sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No, AA447905) in ESTs, but, since they 
are partial sequences , it can not be judged whether or not 

30 any of these sequences codes for the same protein as the 



<HP1042^> [SEQ ID Nos . 



1 4 ana 2 4 ) 
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Determination of the whole base sequence of the cDNA 
insert of clone HP10423 obtained from cDNA library of human 
osteosarcoma cell line U-2 OS revealed the structure 
consisting of a 64-bp 5 ' -untranslated region, a 795-bp ORF, 
5 and a 207-bp 3 ' -untranslated region- The ORF codes for a 
protein consisting of 264 amino acid residues and there 
existed a secretory signal at the N-terminus and one 
putative transmembrane domain at the N-terminus. Figure 4 
depicts the hydrophobicity/hydrophilicity profile, obtained 

10 by the Kyte-Doolittle method , of the present protein. In 
vitro translation resulted in formation of a translation 
product of 3 0 kDa that was almost identical with the 
molecular weight of 2 9,377 predicted from the ORF . When 
expressed in COS7 cells, an expression product of about 31 

15 kDa was observed in the membrane fraction. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 9 0% or more (for 
example, Accession No. D80116) in ESTs, but, since they are 

20 partial sequences , it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
of the present invention. 

<HP10506> (SEQ ID Nos. 5, 15, and 25) 
25 Determination of the whole base sequence of the cDNA 

insert of clone HP10506 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 53-bp 
5 '-untranslated region, a 339-bp ORF, and a 226-bp 3'- 
untranslated region. The ORF codes for a protein consisting 

ransmemDr ant aomain. riqure jepicis :_ne 

hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
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Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 12 kDa that was almost identical with the molecular 
weight of 11,821 predicted from the ORF. When expressed in 
5 C0S7 cells, an expression product of about 13 kDa was 
observed in the membrane fraction. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more ( for 
10 example, Accession No. AA282544 ) in ESTs , but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

15 <HP10507> (SEQ ID Nos . 6, 16, and 26) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10507 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 412-bp 
5 ' -untranslated region, a 441-bp ORF, and a 168-bp 3'- 

20 untranslated region. The ORF codes for a protein consisting 
of 14 6 amino acid residues and there existed a secretory 
signal at the N-terminus and one putative transmembrane 
domain at the C-terminus . Figure 6 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 

25 Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 19 kDa that was somewhat larger than the molecular weight 
of 16,347 predicted from the ORF. 

Furth^nr.orp, the search of th*=> GenBank rsina the base 

or sequences t. r.a^ snarea a nomoxoqy oi 90% c: rr.ore : tor 
example, Accession No. AA424759 ) in ESTs , but, since they 
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are partial sequences , it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

5 <HP10548> (SEQ ID Nos . 7 , 17, and 27) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10548 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 330-bp 
5 '-untranslated region, a 1035-bp ORF, and a 67-bp 3'- 
10 untranslated region. The ORF codes for a protein consisting 
of 344 amino acid residues and there existed four putative 
transmembrane domains . Figure 7 depicts the 

hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
15 translation resulted in formation of a translation product 
of a high molecular weight. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more ( for 
20 example, Accession No. AA143152) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

25 <HP10566> (SEQ ID Nos. 8, 18, and 28) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10566 obtained from cDNA library of the 
human stomach cancer revealed the structure consisting of a 

61 -bp 5 '-untranslated reaion , a 294-bp ORF, and a ?46-bp 3'- 



> amine acia residues ana tnere existed one putative 
transmembrane domain at the C-terminus. Figure 8 depicts the 
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hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 12 kDa that was almost identical with the molecular 
5 weight of 11,452 predicted from the ORF . When expressed in 
C0S7 cells , an expression product of about 12 kDa was 
observed in the membrane fraction. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
10 of sequences that shared a homology of 90% or more ( for 
example, Accession No. W79821) in ESTs, but, since they are 
partial sequences, it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
of the present invention. 

15 

<HP10567> (SEQ ID Nos. 9, 19, and 29) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10567 obtained from cDNA library of the 
human stomach cancer revealed the structure consisting of a 

20 77-bp 5 '-untranslated region, a 375-bp ORF, and a 133-bp 3'- 
untranslated region. The ORF codes for a protein consisting 
of 124 amino acid residues and there existed one putative 
transmembrane domain at the C-terminus. Figure 9 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 

25 Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 14 kDa that was almost identical with the molecular 
weight of 14,484 predicted from the ORF. 

o: sequences that, snared homology oi 90% cr "ore ( t or 
example, Accession No. AA428475) in ESTs, but, since they 
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are partial sequences , it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

5 <HP10568> (SEQ ID Nos. 10, 20, and 30) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10568 obtained from cDNA library of the 
human stomach cancer revealed the structure consisting of a 
56-bp 5 '-untranslated region, a 984-bp ORF, and a 60-bp 3'- 

10 untranslated region- The ORF codes for a protein consisting 
of 32 7 amino acid residues and there existed a secretory 
signal at the N-terminus and one putative transmembrane 
domain at the C-terminus . Figure 10 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 

15 Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 36.5 kDa that was almost identical with the molecular 
weight of 34, 326 predicted from the ORF. In this case, the 
addition of a microsome led to the formation of a product of 

20 40 kDa which is considered to have a sugar chain being 
attached. In addition, there exist in the amino acid 
sequence of this protein two sites at which N-glycosylation 
may occur ( Asn-Leu-Thr at position 138 and Asn-Leu-Ser at 
position 206). Application of the (-3,-1) rule, a method for 

25 predicting the cleavage site of the secretory signal 
sequence, allows to expect that the mature protein starts 
from valine at position 24. When expressed in COS7 cells, an 
expression product of about 31 kDa was observed in the 
supernatant fraction and the membrane fraction. 

icici sequence o: trie 
protein was similar 



present., proten; nas reveaiej tnat t.he 
to the human cell-surface A33 antigen 
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(SWISS-PROT Accession No. Q99795). Table 5 ^Lcv/s the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and the human cell-surface A33 
antigen (A3). Therein, the marks of *, and . represent a 
5 gap, an amino acid residue identical with that of the 
protein of the present invention, and an amino acid residue 
similar to that of the protein of the present invention , 
respectively. The both proteins shared a homology of 30.0% 
in the N-terminal region of 243 residues. 

10 

Table 5 



HP MAELPGPFLCGALLGFLCLSGLAVEVKVPTEPLSTPLGKTAELTCTYSTSVGDSFAL-EW 

*..*..* - * **...*.*** ** * .* 

15 A3 MVGKMWPVLWTLCAVRVTVDAISVETPQDVIJlASQGKSVTLPCTyHTSTSSREGLIQW 

HP SFVQPGKPISESHPILYFTNGHLYPTGSKSKRVSLLQNPPTVGVATLKLTDVHPSDTGTY 

. . . .*. * *.* .**.*. . * . * *.*** 

A3 DKLL — LTHTERWIWPFSNKN-YIHGELYKNRVSISNNAEQSDASITIDQLTMADNGTY 
HP LCQV1TOPPDFYTNGLGLINLTVLVPPSNPLCSQSGQTSVGGS xALRCSSSEGAPKPVYNV^ 

20 * * . .*» . ******.* * . . * . * * * 

A3 ECSVSLMSDLEGNTKSRVRLLVLVPPSKPECGIEGETIIGNNIQLTCQSKEGSPTPQYSW 
HP VRLGTFPTPSPGSMVQDEVSGQLILTNLSLTSSGTYRCVATI/QMGSASCELTLSVTEP.^-*- 

* ... * ..* . . *.*.* ..** * *...*. *.. *..*..* .** 

A3 KRYNILNQEQP--LAQPASGQPVSLKNISTDTSGYYICT5SNEFGTprC"NITVAVj' 

25 HP -QGRVAGALIGVLLGVLLLSVAAFCLVRFQKERGKKPKETYGGSDLPEDA fAPG'; PC 

. * ** * 
* * • •••••*•• • 

A3 NVALYVGIAVGWAALIIIGIIIYCCCCRGKDDNTEDKEDA}^PNREAYEf:PPl£pl WS'^U 

HP MRADSSKGFLERPSSASTVTTTKSKLPMW 

30 A3 EREEEDDYRQEEQRSTGRESPDHLDQ 



F u rtner^o r o , I he sear c r i : \ : \ *j ■. ■ ■ : \v . ■ . ■ 
sequences of the present cDNA hai' reveoif^ ~\,c r ^ .. . . ic-t 
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of sequences that shared a homology of 90% or more (for 
example, Accession No. T24595) in ESTs, but, since they are 
partial sequences, it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
5 of the present invention. 

<HP01426> (SEQ ID Nos . 31, 41, and 51) 

Determination of the whole base sequence of the cDNA 
insert of clone HP01426 obtained from cDNA library of human 

10 stomach cancer revealed the structure consisting of a 1-bp 
5 '-untranslated region, a 942-bp ORF, and a 122-bp 3 
untranslated region. The ORF codes for a protein consisting 
of 313 amino acid residues and there existed a putative 
secretory signal . Figure 1 1 depicts the 

15 hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein* In vitro 
translation resulted in formation of a translation product 
of 36 kDa that was almost identical with the molecular 
weight of 34,955 predicted from the ORF. In this case, the 

20 addition of a microsome led to the formation of a product of 
3 8 kDa which is considered to have a sugar chain being 
attached after secretion. In addition, there exists in the 
amino acid sequence of this protein one site at which N- 
glycosylation may occur (Asn-Ser-Ser at position 163). 

25 Application of the (-3,-1) rule, a method for predicting the 
cleavage site of the secretory signal sequence, allows to 
expect that the mature protein starts from tryptophan at 
position 17 . When expressed in COS7 cells, an expression 
- >~~-^ ~- ~ -r ^ 0 . , +. a vn^ was nbsorvf>d i n t ho siiDernatant 

T h e s e a r c n c :. t he Drotem data n ase using t he a mm o 
acid sequence of the present protein revealed that the 
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protein was similar to the Xenopus laevis cortical granule 
lectin (EMBL Accession No. X82626). Table 6 shows the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and the X. laevis cortical 
5 granule lectin ( XL ) . Therein , the marks of - , * , and . 
represent a gap, an amino acid residue identical with that 
of the protein of the present invention , and an amino acid 
residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
10 of 67.9% in the region other than the N-terminal region. 



Table 6 



HP MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRT 
15 * ** * ★****★**. .* **.* * # 

XL MLVHILLLLVTGGLSQSCEPWIVASKNMVKQLDCDKFRSCKEIKDSNEEAQDGIYTLTS 
HP ENGVIYQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANY 
# . * . ********. .*************.* ****.*********..************* 
XL SDGISYQTFCDMTTNGGGWTLVASVHENNMAGKCTIGDRWSSQQGNRADYPEGDGNWANY 
20 HP NTFGSAEAATSDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLG 

****** m m ************** m * . ** . ******.* . ****** ****..*.* . * 
XL NTFGSAGGATSDDYKNPGYYDIEAYNLGVWHVPNKTPLSVWRNSSLQRYRTTDGILFKHG 
HP HNLFGIYQKYPVKYGEGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRV 
***..*. ****** *.* .*.**..*****.*.*. ***.*** ...**.*..*** 
25 XL GNLFSLYRIYPVKYGIGSCSKDSGPTVPWYDLGSAKLTASFYSPDFRSQFTPGYIQFRP 

HP FNNERAANALCAGMRVTGCNTEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSS 

. * # * . ** *** # ** # ...**.** *******★★**.*.*****...*..****. 
XL INTEKAALALCPGMKMESCNVEHVCIGGGGYFPEADPRQCGDFAAYDFNGYGTKKFNSAG 

HP REITEAAVLLFYR 
30 *********** 
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Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that s hared a homology of 9 0 % or more ( for 
example, Accession No. R06009) in ESTs, but, since they are 
5 partial sequences, it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
of the present invention. 

<HP02515> (SEQ ID Nos . 32, 42, and 52) 

10 Determination of the whole base sequence of the cDNA 

insert of clone HP02515 obtained from cDNA library of human 
osteosarcoma cell line Saos-2 revealed the structure 
consisting of a 176-bp 5 ' -untranslated region, a 690-bp ORF, 
and a 71-bp 3 ' -untranslated region. The ORF codes for a 

15 protein consisting of 22 9 amino acid residues and there 
existed a putative secretory signal at N-terminus and one 
putative transmembrane domain at the C-terminus. Figure 12 
depicts the hydrophobic ity/hydrophilicity profile, obtained 
by the Kyte-Doolittle method, of the present protein. In 

20 vitro translation resulted in formation of a translation 
product of 2 7 kDa that was almost identical with the 
molecular weight of 26,000 predicted from the ORF. In this 
case, the addition of a microsome led to the formation of a 
product of 25.5 kDa from which the secretory signal is 

25 considered to have been cleaved. Application of the (-3,-1) 
rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from phenylalanine at position 28. 

The search of the protein data base usina the amino 

protein was sunxlar t o zhe human T1/ST2 r eceptor r; inamg 
protein (GenBank Accession No. U41804). Table 7 shows the 
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comparison between amino acid sequences of the human protein 
of the present invention (HP) and the human T1/ST2 receptor 
binding protein (Tl). Therein , the marks of -, *, and . 
represent a gap, an amino acid residue identical with that 
5 of the protein of the present invention, and an amino acid 
residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 55.8% in the entire region. 

10 Table 7 



HP MGDKIV7LPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLE 

* ** *** * ** * * 

.... ■ • 

Tl MMAAGAALALALWLL — MPPVEV-GGAGPPPIQDGEFTFLLPAGRKQCFYQSAPANASLE 

15 HP IEYQVLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVE-TEVGDYMFCFDNTFSTISEK 

**** ***** ** * ** * ** * **.******* *★.***..***★.****★** 
• • • ... » 

Tl TEYQVIGGAGLDVDFTLESPQGVLLVSESRKADGVHTVEPTEAGDYKLCFDNSFSTISEK 
HP VIFFELILDNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQILLR 

..*****.★ * *.* ** # *.*** *** **..* .*** 

20 Tl LVFFELIFDSL-QDDEEVEGWAEAVEPEEMLDVKMEDIKESIETMRTRLERSIQMLTLLR 

HP AFEARDRNIQESNFDRVNFWSMVNLVVMVVVSAIQVYMLKSLFEDKRKSRT 
******** m ** n * ^ # ****** **..*...*...**. **..*.*** .* 
T 1 AF E ARDRNLQE GNLERVNFWS AVNVAVLLL V AVLQVCTLKRF FQD KRP VPT 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA381943) in ESTs , but, since they 

protein ul tut; present invention, 
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<HP02575> (SEQ ID Nos. 33, 43, and 53) 

Determination of the whole base sequence of the cDNA 
insert of clone HP02575 obtained from cDNA library of human 
5 osteosarcome cell line Saos-2 revealed the structure 
consisting of a 55-bp 5 '-untranslated region, a 1404-bp ORF, 
and a 219-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 4 67 amino acid residues and there 
existed a putative secretory signal at the N-terminus. 

10 Figure 13 depicts the hydrophobic ity /hydrophilicity profile, 
obtained by the Kyte-Doolittle method, of the present 
protein. In vitro translation resulted in formation of a 
translation product of 52 kDa that was almost identical with 
the molecular weight of 54,065 predicted from the ORF. In 

15 this case, the addition of a microsome led to the formation 
of a product of 57 kDa which is considered to have a sugar 
chain being attached afetr secretion. In addition, there 
exist in the amino acid sequence of this protein three sites 
at which N-glycosylation may occur (Asn-Arg-Thr at position 

20 171, Asn-Ser-Thr at position 239 and Asn-Asp-Thr at position 
377 ). Application of the (-3,-1) rule, a method for 
predicting the cleavage site of the secretory signal 
sequence, allows to expect that the mature protein starts 
from histidine at position 29. When expressed in COS7 cells, 

25 an expression product of about 55 kDa was observed in the 
supernatant fraction. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the human a -L-f ucos idase CSWISS-PROT 

ammo acxa sequences c: r.ne ::umar: protein o ; me present 
invention (HP) and the human a -L-fucosidase (FC). Therein, 
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the marks of *, and • represent a gap, an amino acid 

residue identical with that of the protein of the present 
invention, and an amino acid residue similar to that of the 
protein of the present invention, respectively. The both 
5 proteins shared a homology of 54.8% in the entire region. 



Table 8 



HP MRPQELPRLAFPLLLLLLLLLPPPPC-PAHSATRFDPTWESLDARQLPAWFDQAKFGIFI 
10 .****★*.* mm # .*... *..*.* ***.*.******.****.** 

FC MRSRPAGPALLLLLLFLGAAESVRRAQPPRRYTPDWPSLDSRPLPAWFDEAKFGVFI 
HP HWGVFSVPSFGSEWFWWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQWAD 
******** ^ # ******* # * * * *.* ** # *****.*.*.*★★ * * * * # * * m m # m ** * 

FC HWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEWAD 
15 HP IFQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRFGL 

*** **** *** ******* * * ***** * ***** ***** . * ** 

FC LFQAAGAKYWLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKR-NIRYGL 
HP YYSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYWN 
*.**.******.* *....*....* ^** # ****. ***.*.*. m .****. m ★* *** 

20 FC YHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYWN 

HP STGFLAWLYNESPVRGTWTNDRWGAGS ICKHGGFYTCSDRYNPGHLLPHKWENCMTIDK 

** ** **** *** ** ***** ****.*.*.*...* * **** * ★ ** 

• •••• •••• •••••• . 

FC STNFLSWL YNDSPVKDEWVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTS IDK 

HP LSWG YRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTIS WFEER RQMGSW 

9f> ****** ** * ***************** .*.*** ** 

4~> KJ • • • • ••••••• • 

FC FSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERu^AVGKW 
HP LKVNGEAI YETHTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILGA 

9 ****** ^ ^ ** * ^ * #- ****** m9 *★****.**..* * *. * ^ 

FC LSINGEAIYASKPWRVQWEKNTTSVWYTSKGSA—VYAIFLHWPENGVLNLESt'ITT- ST 
30 HP TEVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVI 
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Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
5 example, Accession No, N28668) in ESTs, but, since they are 
partial sequences, it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
of the present invention. 

10 <HP10357> (SEQ ID Nos. 34, 44, and 54) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10357 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 113-bp 
5 '-untranslated region, a 300-bp ORF, and a 54-bp 3 '- 

15 untranslated region. The ORF codes for a protein consisting 
of 99 amino acid residues and there existed two putative 
transmembrare domains. Figure 14 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

20 translation resulted in formation of a translation product 
of 11 kDa that was almost identical with the molecular 
weight of 10,923 predicted from the ORF. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 

25 of sequences that shared a homology of 90% or more (for 
example, Accession No. AA477156) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

< HP .1 u4 4 7 > i SEQ I 7 Nos . ,:. j , 4 d , a na b b } 

Determination of the whole base sequence of the cDNA 



WO 00/05367 



PCT/JP99/03929 



61 

insert of clone HP1044 7 obtained from cDNA library of human 
liver revealed the structure consisting of a 271-bp 5'- 
untranslated region, a 570-bp ORF, and a 34-bp 3'- 
untranslated region. The ORF codes for a protein consisting 
5 of 189 amino acid residues and there existed five putative 
transmembrare domains. Figure 15 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

10 of high molecular weight. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA296976) in ESTs, but, since they 

15 are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP10477> (SEQ ID Nos . 36, 46, and 56) 

20 Determination of the whole base sequence of the cDNA 

insert of clone HP10477 obtained from cDNA library of human 
liver revealed the structure consisting of a 14 9-bp 5'- 
untranslated region, a 1092-bp ORF, and a 15-bp 3'- 
untranslated region. The ORF codes for a protein consisting 

25 of 363 amino acid residues and there existed one putative 
transmembrane domain at the N-terminus . Figure 16 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 
*~ t~ r\ n r 7 * 1 ^ t" 1 ^ ^ >- o <- 1 in f n T~m of ^ ^r^nsl .^t i or product" 



weight ci 39 ,884 predicted from the ORF . 

The search of the protein data base using the amino 
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acid sequence of the present protein revealed that the 
protein was similar to the human peptidoglycan recognition 
protein (GenBank Accession No. AF076483). Table 9 shows the 
comparison between amino acid sequences of the human protein 
5 of the present invention (HP) and the human peptidoglycan 
recognition protein (PG) . Therein, the marks of *, and . 
represent a gap, an amino acid residue identical with that 
of the protein of the present invention, and an amino acid 
residue similar to that of the protein of the present 
10 invention, respectively. The both proteins shared a homology 
of 54.8% in the entire region. 

Table 9 



15 HP MVDSLLAVTLAGNLGLTFLRGSQTQSHPDLGTEGCWDQLSAPRTFTLLDPKASLLTKAFL 

HP NGALDGVILGDYLSRTPEPRPSLSHLLSQYYGAGVARDPGFRSNFRRQNGAALTSASILA 
HP QQVWGTLVLLQRLEPVHLQLQCMSQEQLAQVAANATKEFTEAFLGCPAIHPRCRWGAAPY 

* , a * ** * * ^ 

PG MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALA- 

20 HP RGRPKLLQLPLGFLYVHHTYVPAPPCTDFTRCAANMRSMQRYHQDTQGWGDIGYSFWGS 

m m m m * * ** m * * * ....*.. *..*.** . * ** *.**.*..*. 

PG SECAQHLSLPLRYVWSHT — AG S S CNTP AS CQQQARNVQH YHMKTLGWCDVG YNFL I GE 

HP DGYVYEGRGWHWVGAHTLGH-NSRGFGVAIVGNYTAALPTEAALRTVRDTLPSCAVRAGL 

********* *** * * *** . * * .*.*.... * . * * * 

* •••••••• • * •••••• • • • 

25 PG DGLVYEGRGWNFTGAHSGHLWNPMSIGISFMGNYMDRVPTPQAIRAAQGLL-ACGVAQGA 

HP LRPDYALLGHRQLVRTDCPGDALFDLLRTWPHFTATVKPRPARSVSKRSRREPPPRTLPA 

**••*•* ***.. ** •**..*..*•••***• 

PG LRSNYVLKGHRDVQRTLSPGNQLYHLIQNWPHYRSP 
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of sequences that shared a homology of 90% or more (for 
example, Accession No. AA424759) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
5 protein of the present invention. 

<HP10513> (SEQ ID Nos . 37, 47, and 57) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10513 obtained from cDNA library of human 

10 stomach cancer revealed the structure consisting of a 134-bp 
5' -untranslated region, a 750-bp ORF, and a 0-bp 3'- 
untranslated region. The ORF codes for a protein consisting 
of 24 9 amino acid residues and there existed one putative 
transmembrane domain at the N-terminus. Figure 17 depicts 

15 the hydrophobic ity / hydro philic it y profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 29 kDa that was almost identical with the molecular 
weight of 27,3 73 predicted from the ORF. 

20 The search of the protein data base using the amino 

acid sequence of the present protein revealed that the 
protein was similar to the human hypothetical protein 
KIAA0512 (GenBank Accession No. AB011084). Table 10 shows 
the comparison between amino acid sequences of the human 

25 protein of the present invention (HP) and the human 
hypothetical protein KIAA0512 (KI). Therein, the marks of -, 
*, and . represent a gap, an amino acid residue identical 
with that of the protein of the present invention, and an 

homology oi 3 1.6% 1:1 the c-temiinai regie:, ci 196 amino acid 
residues . 
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Table 10 



HP MGGPRGAGWAAGLLLGAGACYCIYRLTRGRRRG 

5 

KI RGRGRRPVAMQKRPFPYEIDEILGVRDLRKVLALLQKSDDPFIQQVALLTLSNNANYSCN 
HP DRELGIRSSKSAEDLTDGSYDDVLNAEQLQKLLYLLESTEDPVIIERALITLGNNAAFSV 

* * . * * # * * 

KI QETIRKLGGLPIIANMINKTDPHIKEKAI24AM 
10 HP NQAIIRELGGIPIVANKINHSNQSIKEKALNALNNLSVNVENQIKIKVQVLKLLLNLSEN 

* ... * *..** 

KI LNSAVQWGLKFLTNMTITNDYQHLLVNSIANF — FRLLSQGGGKIKVEILKILSNFAEN 
HP PAMTEGLLRAQVDSSFLSLYDSHVAKEILLRVLTLFQNIKNCLKIEGHLAVQPTFTEGSL 
*.* . **. # ** .★* ***.*.*..***...***★. * . *. * . ..*..*** 
15 KI PDMLKKLLSTQVPASFSSLYNSYVESEILINALTLFEIIYDNLRAE — VFNYREFNKGSL 

HP FFL-LHGEECAQKIRALVDHHDAEVKEKWTIIPKI 
* * * ***** *** ** ** ... * 

■ mm mm mm m m m m m 

KI F YLCTT S GVCVKKI RALANHHDLLVKVKVI KLVNKF 

20 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. N92228) in ESTs, but, since they are 
25 partial sequences, it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
of the present invention. 

<HP10540> (SEQ ID Nos . 38, 48, and 58) 



osteosarcoma cell . me Saos-2 revealed the structure 
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consisting of a 47-bp 5 ' -untranslated region, a 297-bp ORF, 
and a 245-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 98 amino acid residues and there 
existed two putative transmembrane domains. Figure 18 
depicts the hydrophobicity/hydrophilicity profile, obtained 
by the Kyte-Doolittle method, of the present protein. In 
vitro translation resulted in formation of a translation 
product of high molecular weight. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the Caenorhabditis elegans 
hypothetical protein CEF49C12 .12 ( GenBank Access ion No . 
268227). Table 11 shows the comparison between amino acid 
sequences of the human protein of the present invention (HP) 
and the c . elegans hypothetical protein CEF4 9C1 2 .12 ( CE ) . 
Therein, the marks of -, *, and . represent a gap, an amino 
acid residue identical with that of the protein of the 
present invention, and an amino acid residue similar to that 
of the protein of the present invention, respectively. The 
both proteins shared a homology of 3 6.1% in the entire 
region. 

Table 11 



HP M-ASLLCCGPKLAACGIVLSAWGVIMLIMLGIFFNVHSAVLIEDVPFTEKDFENGPQNIY 
* *** * ★ * **** * ** ** * * * * * 

CE MGKICPLMGPKMSAFCMVMSVWGVIFLGLLGVFFYIQAVTLFPDLHF-EGHGKVPSSVID 
HP NLYEQVSYNCFIAAGLYLLLGGFSFCQVRLNKRKEYMVR 
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Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of seguences that shared a homology of 90% or more (for 
5 example, Accession No. AA420715) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

10 <HP10557> (SEQ ID Nos . 39, 49, and 59) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10557 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 24-bp 
5 '-untranslated region, a 519-bp ORF, and a 130-bp 3'- 

15 untranslated region. The ORF codes for a protein consisting 
of 172 amino acid residues and there existed a putative 
secretory signal at the N-terminus. Figure 19 depicts the 
hydrophobic ity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 

20 translation resulted in formation of a translation product 
of 32 kDa that was larger than the molecular weight of 
18,844 predicted from the ORF. In this case, the addition of 
a microsome led to the formation of a product of 3 9 kDa 
which is considered to have been subjected to some 

25 modification after secretion. In addition, there exist in 
the amino acid sequence of this protein no site at which N- 
glycosylation may occur. Application of the (-3,-1) rule, a 
method for predicting the cleavage site of the secretory 
signal sequence, allows to expect that the mature protein 

^eix^, a:i expressic:. proauci c±. cidoul ...... /w <^ wa^ uuservea ^i. 

the supernatant fraction and the membrane fraction* 
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The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the human progesterone binding 
protein (EM3L Accession No, AJ002030). Table 12 shows the 
5 comparison between amino acid sequences of the human protein 
of the present invention (HP) and the human progesterone 
binding protein (PG) . Therein, the marks of *, and . 

represent a gap, an amino acid residue identical with that 
of the protein of the present invention, and an amino acid 
10 residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 30.5% in the C-terminal region of 151 amino acid residues. 



15 



Table 12 



HP MVGPAP 



PG MAAGDGDVKLGTLGSGSESSNDGGSESPGDAGAAAEGGGWAAAALALLTGGGEMLLNVAL 
HP RRRLRP LAALALVL AL AP GLP TARAGQTPRP AERGP P V — RLFTEEELARYGGEEEDQPI 

20 ** **.. *.. * *. *•* .*•* . * 

PG VALVLLGAYRLWVRWGRRGLGAGAGAGEESPATSLPRMKKRDFSLEQLRQYDG-SRNPRI 
HP YLAVKGWFDVTSGKEFYGRGAPYNALTGKDSTRGVAKMSLDPADLTHDTTGLTAKELEA 

..*.*..**.*...** ..*.. . 

PG LLAVNGKVFDVTKGSKFYGPAGPYGIFAGRDASRGLATFCLDKDALRDEYDDLSDLNAVQ 

25 HP LDEV— FTKVYKAKYPIVGYTARRILNEDGSPNLDFKPEDQPHFDIKDEF 

...* ... .*.** *.*. ... * *.. 

PG MESVREWEMQFKEKY DYVG-RLLKPGEEPS-EYTDEEDTKDHNKQD 



_ -£- ,., , ^-1 *- 1 j 2 - V-v,-,T^^sl^^.fr r-\-f O C\\ r\v m r> >- ^ / f ri>- 
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example, Accession No. AA101709) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

5 

<HP10563> (SEQ ID Nos . 40, 50, and 60) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10563 obtained from cDNA library of human 
osteosarcoma cell line Saos-2 revealed the structure 

10 consisting of a 126-bp 5 ' -untranslated region, a 363-bp ORF, 
and a 93 6-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 12 0 amino acid residues and there 
existed two putative transmembrane domains. Figure 2 0 
depicts the hydrophobicity/hydrophilicity profile, obtained 

15 by the Kyte-Doolittle method, of the present protein. In 
vitro translation resulted in formation of a translation 
product of 18.5 kDa that was larger than the molecular 
weight of 13,180 predicted from the ORF. 

The search of the protein data base using the amino 

20 acid sequence of the present protein revealed that the 
protein was similar to the Arabidopsis thaliana hypothetical 
protein F27F23.15 (GenBank Accession No. AC003058). Table 13 
shows the comparison between amino acid sequences of the 
human protein of the present invention (HP) and the A. 

25 thaliana hypothetical protein F27F23.15 (AT). Therein, the 
marks of *, and . represent a gap, an amino acid residue 
identical with that of the protein of the present invention, 
and an amino acid residue similar to that of the protein of 
the present invention, respectively. The both proteins 
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Table 13 



HP MMPSRTNLATGIPSSKVKYSRLSSTDDGYIDLQFKKTPPKIPYKAIALATVLFLIGAFLI 

* * * ****** * * 

# « • ■ • • • • . . • •• 

AX MAYVDHAFSISDEDLMIGTSY-TVSNRPPVKEISLAVGLLVFGTLGI 
HP IIGSLLLSGYISKGGADRAVPVLIIGILVFLPGFYHLRIAYYASKGYRGySYDDIPDFDD 

^ * ** s *^****. *********.*.*...** 

AT VLGFFMAYNRVG-GDRGHGIFFIVLGCLLFIPGFYYTRIAYYAYKGYKGFSF SNIPS V 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA083574) in ESTs, but, since they 
15 are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP01467> (SEQ ID Nos . 61, 71, and 81) 

20 Determination of the whole base sequence of the cDNA 

insert of clone HP01467 obtained from cDNA library of human 
fibrosarcoma cell line HT-1080 revealed the structure 
consisting of a 65-bp 5 ' -untranslated region, a 924-bp ORF, 
and a 447-bp 3 '-untranslated region. The ORF codes for a 

25 protein consisting of 307 amino acid residues and there 
existed three putative transmembrane domains. Figure 21 
depicts the hydrophobicity/hydrophilicity profile, obtained 
by the Kyte-Doolittle method, of the present protein. In 
vitro translation resulted in formation of a translation 
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acid sequence of the present protein revealed that the 
protein was similar to the rat Sec22 homologue (GenBank 
Accession No. U42209). Table 14 shows the comparison between 
amino acid sequences of the human protein of the present 
5 invention (HP) and the rat Sec22 homologue (RN) . Therein, 
the marks of *, and . represent a gap, an amino acid 

residue identical with that of the protein of the present 
invention, and an amino acid residue similar to that of the 
protein of the present invention, respectively. The both 
10 proteins shared a homology of 94.6% in the N-terminal region 
of 241 amino acid residues. The protein Oj. t^e present 
invention was longer by 53 amino acids at the C-terminus 
than the rat Sec2 2 homologue. 

15 Table 14 



HP MSMILSASVIRVRDGLPLSASTDYEQSTGMQECRKYFKMLSRKLAQLPDRCTLKTGHYNI 

** 

RN MSMILSASWRVRDGLPLSASTDCEQSAGVQECRKYFKMLSRKLAQFPDRCTLKTGRHNI 
20 HP NFISSLGVSYMMLCTENYPNVLAFSFLDELQKEFITTYNMMKTNTAVRPYCFIEFDNFIQ 

************************************************************ 
RN NFISSLGVSYMMLCTENYPNVLAFSFLDELQKEFITTYNMMKTNTAVRPYCFIEFDNFIQ 
HP RTKQRYNNPRSLSTKINLSDMQTEIKLRPPYQISMCELGSANGVTSAFSVDCKGAGKISS 
********************** **********.************************** 
25 RN RTKQRYNNPRSLSTKINLSDMQMEIKLRPPYQIPMCELGSANGVTSAFSVDCKGAGKISS 

HP AHQRLEPATLSGIVGFILSLLCGALNLIRGFHAIESLLQSDGDDFNYIIAFFLGTAACLY 
**************,***************************.**.*.************ 
RN AHQRLEPATLSGIVAFILSLLCGALNLIRGFHAIESLLQSDGEDFSYMIAFFLGTAACLY 
HP QCYLLVYYTGWRNVKSFLTFGLICLCNMYLYELRNLWQLFFHVTVGAFVTLQIWLRQAQG 

30 
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Furthermore, the search of the GenBank using ^hu h ise 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA421925) in ESTs, but, since they 
5 are partial sequences, it can not be judged whether or aot 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP01956> (SEQ ID Nos . 62, 72, and 82) 
10 Determination of the whole base sequence of the cDNA 

xnoci u wjl ^j-unc lijc v x ^ u uuuaxucu xjlwul uuiin j.xjJi.ai^ ui IlUiUcLIl 

liver revealed the structure consisting of a 8 6 -hp 5'- 
untranslated region, a 552-bp ORF, and a 359-bp 3'- 
untranslated region. The ORF codes for a protein consisting 

15 of 183 amino acid residues and there existed one pv>"U;'Uve 
transmembrane domain. Figure 22 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein- In vitro 
translation resulted in formation of a tran^lrMnn product 

20 of 20.5 kDa that was almost identical with the raol<o^lar 
weight of 2 0,073 predicted from the ORF. 

The search of the protein data base using the ^r u ino 
acid sequence of the present protein revf\: led t} K v th^ 
protein was similar to the yeast hypothetical protein ^1.5 

25 kDa (SWISS-PROT Accession No. P53073 ). Table 15 srov;. the 
comparison between amino acid sequences of the humav protein 
of the present invention (HP) and the yeasL hypoUietioa} 
protein 21.5 kDa (SC). Therein, the marks of ~, *, and . 
represent a gap, an amino acid residue identical with that 

invention, respectively. The both proteir.b ^r^. j r ^ ^. v 
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of 34.3% in the C-terminal region of 108 amino acid residues. 

Table 15 



5 HP MTAQGGLVANRGRRFKWAIELSGPGGGSRGRSDRGSGQGDSLYPVGYLDKQVPDTS 

SC MSEQEPYEWAKHLLDTKYIEKYNIQNSNTLPSPPGFEGNSSKGNVTRKQQDATSQTTSLA 
HP VQETDRILVEKRCWDIALGPLKQIPMNLFIMYMAGNTISIFPTMMVCMMAWRPIQALMAI 
.* *.*** * * **★*.*. **.*... *.*.* . *. **.*.... 

10 SC QKNQITVLQVQKAWQIALQPAKSIPMNIFMSYMSGTSLQIIPIMTALMLLSGPIKAIFST 

rm f * mr?v \*t t>c* e* c* /-wct r\r>r ttvt TrxiT MPT TIT V_ VPnCM^T.T.DTUIl CTMaTT 7\ T T? "D T> T? 

** * * * * * ***** * *** 

• • • .... . • • . ... • . • 

SC RSAFKPVLGNKATQSQVQTAMFMYIVFQGVLMYIGYRKLNSMGLIPNAKGDWLPWERIAH 
HP RMEFSGGGLLL 



15 



SC YNNGLQWFSD 



Furthermore , the search of the GenBank using the base 
20 sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA159753) in ESTs, but, since they 
are partial sequences , it can not be judged whether or not 
any of these sequences codes for the same protein as the 
25 protein of the present invention. 

<HP02545> (SEQ ID Nos . 63, 73, and 83) 

Determination of the whole base sequence of the cDNA 
insert of clone HP02545 obtained from cDNA library of human 



and ci 6 3 6 - hp 



-untranslated region. The ORF codes tor a 



WO 00/05367 PCT/JP99/03929 

73 



protein consisting of 327 amino acid residues and there 
existed a putative secretory signal at the N-terminus and 
one putative transmembrane domain at the C-terminus . Figure 
23 depicts the hydrophobicity/hydrophilicity profile, 
5 obtained by the Kyte-Doolittle method, of the present 
protein. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the rat embigin (EMBL Accession No. 

10 AJ009698). Table 16 shows the comparison between amino acid 
sequences of the human protein of the present invention (HP) 
and the rat embigin (RN) . Therein, the marks of *, and . 
represent a gap, an amino acid residue identical with that 
of the protein of the present invention, and an amino acid 

15 residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 65.4% in the entire region. 
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Table 16 



HP MRALPGLLEARARTPRLLLLQCLLAAARPSSADGSAPDSPFTSPPLREEIMAN — NFSLE 

5 RN MRSHTGLRALVAPGCSLLLL-YLLAATRPDRAVGDPADSAFTSLPVREEMMAKYANLSLE 
HP SHNISLTEHSSMPVEKNITLERPSNVNLTCQFTTSGDLNAVNVTWKKDGEQLE — NNYLV 

****** * ******** * * **...* ******* ** 

■ • ••••• • •••• • • « . . • 

RN TYNISLTEQTRVS-EQNITLERPSHLELECTFTATEDVMSMNVTWKKDDALLETTDGFNT 

HP SATGSTLYTQYRFTIINSKQMGSYSCFFREEKEQRGTFNFKVPELHGKNKPLISYVGDST 
10 * *** ***** ****** **** ** ***** ********* * ****** 

RN TKMGDTLYSOYRFTVFNSKOMGKYSCFLGEE — LRGTFNIRVP KVHGKNKPL IT YVGDS T 

HP VLTCKCQNCFPLNWTWYSSNGSVKVPVGVQM-NKYVINGTYANETKLKITQLLEEDGESY 
** * **** ******* *** ** * .* *********** ****** ** 

RN VLKCECQNCLPLNWTWYMSNGTAQVPIDVHVNDKFDINGSYANETKLKVKHLLEEDGGSY 

15 HP WCRALFQLGESEEHIELWLSYLVPLKPFLVIVAEVILLVATILLCEKYTQKKKKHSDEG 
**** * ******** ***** ******* ************** ****** * * 

• . « . • 

RN WCRAAFPLGESEEHIKLWLSFMVPLKPFLAIIAEVILLVAI ILLCEVYTQKKKNDPDDG 
HP KEFEQIEQLKSDDSNGIENNVPRHRKNESLGQ 
*********************** m ** # m * # * 

20 RN KEFEQIEQLKSDDSNGIENNVPRYRKTDSGDQ 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
25 of sequences that shared a homology of 90% or more (for 
example, Accession No. AA312 629) in ESTs , but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

30 

i; t_ e mi maiio;. ^ _ t w aoit- uas e . ■ equen c ^ . . ; r - *: ' u I i n 

insert of clone HP02551 obtained from cDNA Ixbrary of human 
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osteosarcoma cell line Saos-2 revealed the structure 
consisting of a 61 -bp 5 ' -untranslated region, a 672-bp ORF, 
and a 3 84 -bp 3 '-untranslated region. The ORF codes for a 
protein consisting of 223 amino acid residues and there 
5 existed a putative secretory signal at the N-terminus. 
Figure 24 depicts the hydrophobicity/hydrophilicity profile, 
obtained by the Kyte-Doolittle method, of the present 
protein. In vitro translation resulted in formation of a 
translation product of 27 kDa that was somewhat larger than 
10 the molecular weight of 24,555 predicted from the ORF, In 
this case, the addition of a microsome led to the formation 
of a product of 26 kDa from which the secretory signal is 
considered to have been cleaved. Application of the (-3,-1) 
rule, a method for predicting the cleavage site of the 
15 secretory signal sequence, allows to expect that the mature 
protein starts from glutamine at position 20. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the mouse FGF binding protein 
20 (GenBank Accession No. U49641). Table 17 shows the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and the mouse FGF binding 
protein (MM). Therein, the marks of -, *, and . represent a 
gap, an amino acid residue identical with that of the 
25 protein of the present invention, and an amino acid residue 
similar to that of the protein of the present invention, 
respectively. The both proteins shared a homology of 21.2% 
in the entire region other than the N-terminal region. In 
particular, all the eight cysteine residues contained in the 
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Table 17 



HP MKFVPCLLLVTLSCLGTLGQAPRQKQGST 

• • • • • • ••• 

5 MM MRLHSLILLSFLLLATQAFSEKVRKRAKNAPHSTAEEGVEGSAPSLGKAQNKQRSRTSKS 

HP GEEFHFQTGGRDSCTMRPSSLGQGAGEW7LRVDCRNTDQTYWCEYRGQPSMCQAFAADPK 

.* * * *.*.* ..**.. * . *.*. * . * . 

MM LTHGKFVTKDQATC RWAVTEEEQGISLKVQCTQADQEFSCVFAGDPTDCLKHDKD-Q 

HP SYWNQALQELRRLHHACQGA-PVLRPSVCREAGPQAHMQQVTSSLKGSPEPNQQPEAGTP 
10 **.*. ..**. .. *..* .**..•***. * *... .* — * 

MM T YWKOV ARTTiR KQKNT CR D AKS VTjKTRVCRKRFPE SNLKLVNPNARGNTKPRKEKAEVS A 
HP SLRPKATVKLTEATQLGKDSMEELGKAKPTTRPTAKPTQPGPRPGGNEEAKKKAWEHCWK 
★ * ****... * . **** 

MM REHNKVQEAVSTEPNRIKEDI-TLNPAATQTM-TIRDPECLEDPDVLNQ-RKTALEFCGE 
15 HP PFQALCAFLISFFRG 

• • * • • ••••••• 

MM SWSS ICTFFLNMLQATSC 



20 Furthermore, the search of the GenBank using the base 

sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA317400) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 

25 any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP02631> (SEQ ID Nos. 65, 75, and 85) 

Determination of the whole base sequence of the cDNA 



"insisting c: a 4/-bp [< 1 -untransiatea region, <; 147-bp ORF , 
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and a 1191-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 48 amino acid residues and there 
existed a putative secretory signal at the N-terminus. 
Figure 25 depicts the hydrophobicity/hydrophilicity profile, 
5 obtained by the Kyte-Doolittle method, of the present 
protein. In vitro translation resulted in formation of a 
translation product of 10 kDa or less. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
10 of sequences that shared a homology of 90% or more (for 
example, Accession No. AA156969) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

15 

<HP02632> (SEQ ID Nos . 66, 76, and 86) 

Determination of the whole base sequence of the cDNA 
insert of clone HP02632 obtained from cDNA library of human 
fibrosarcoma cell line HT-1080 revealed the structure 

20 consisting of a 50-bp 5 ' -untranslated region, a 1116-bp ORF, 
and a 337-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 371 amino acid residues and there 
existed eight putative transmembrane domains. Figure 26 
depicts the hydrophobicity/hydrophilicity profile, obtained 

25 by the Kyte-Doolittle method, of the present protein. In 
vitro translation resulted in formation of a translation 
product of high molecular weight. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed t hh t the 

i.ypotnciicd.i pi.oL.exi. v^LC-ih ^ ■ -eribariK njcessic;:. s . 
Table 18 shows the comparison between amino acid r » 
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of the human protein of the present invention (HP) and the C. 
elegans hypothetical protein CELC2H12 (CE). Therein, the 
marks of *, and . represent a gap, an amino acid residue 
identical with that of the protein of the present invention, 
5 and an amino acid residue similar to that of the protein of 
the present invention, respectively. The both proteins 
shared a homology of 51.4% in the entire region. 



Table 18 

10 - 

HP MAWTKYQLFLAGLMLVTGS INTLSAKWADNFMAEGCGGSKEHSFQHPFLQAVGMFLGEFS 
*.*★**.**. .*****.. .*. . ^* # ****** # ** # *** 

CE MVAFAVI I S VMMWTGS LNTI CAKWADS IKAD GVPFNHPFLQATCMFFGEFL 

HP CLAAFYL LRCRAAGQSDS SVDPQQPFNPLLFLPPALCDMTGTSL 

15 **..*.* * ...*.*.* . . .****.**.******. ***. 

CE CLWFFLIFGYKRYVWNRANVQGESGSVTEITSEEKPTLPPFNPFLFFPPALCDILGTS I 
HP MYVALNMTSASSFQMLRGAVIIFTGLFSVAFLGRRLVLSQWLGILATIAGLVWGLADLL 

** # .** # * # it****************.**..* . .*•*.* • • ***.**..*. 

CE MYIGLNLTTASSFQMLRGAVIIFTGLLSVGMLNAQIKPFKWFGMLFVMLGLVIVGVTDIY 
20 HP SKHDSQHKLSEVITGDLLIIMAQIIVAIQMVLEEKFVYKHNVHPLRAVGTEGLFGFVILS 
* **★ *** *********** *.*.. *..*..* *** ******** 
CE YDDDPLDDKNAIITGNLLIVMAQIIVAIQMVYEQKYLTKYDVPALFAVGLEGLFGMVTLS 
HP LLLVPMYYIPAG-SFSGNPRGTLEDALDAFCQVGQQPLIAVALLGNISSIAFFNFAGISV 
.* #- *.***. b # **.** * ***.. * * **.** * # . *********.** 

25 CE ILMIPFYYIHVPRTFSTNPEGRLEDVFYAWKEITEEPTIALALSGTVVSIAFFNFAGVSV 

HP TKELSATTRMVLDSLRTWIWALSLALGWEAFHALQILGFLILLIGTALYNGLHRPLLGR 

************** # ** m *** # 9 * m # * * * * # * # ** .*..** .**.. 
CE TKELSATTRMVLDSVRTLVIWWSIPLFHEKFIAIQLSGFAMLILGTLIYNDILIGPWFR 
HP LSRGRPLAEESEQERLLGGTRTPINDAS 
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Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. N50907) in ESTs, but, since they are 
5 partial sequences, it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
of the present invention. 

<HP10488> (SEQ ID Nos . 67, 77, and 87) 

10 Determination of the whole base sequence of the cDNA 

insert of clone HP10488 obtained from cDNA library of human 
liver revealed the structure consisting of a 39-bp 5'- 
untranslated region, a 273-bp ORF, and a 421-bp 3'- 
untranslated region. The ORF codes for a protein consisting 

15 of 90 amino acid residues and there existed one putative 
transmembrane domain at the N-terminus. Figure 27 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

20 of 10 kDa that was almost identical with the molecular 
weight of 10,151 predicted from the ORF. When expressed in 
COS7 cells, an expression product of about 6 kDa was 
observed in the membrane fraction. 

Furthermore, the search of the GenBank using the base 

25 sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. H73534) in ESTs, but, since they are 
partial sequences, it can not be judged whether or not any 
of these seauences codes for the same protein as the protein 

<HP10538> (SEQ ID Nos. 68, 78, and 88) 
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Determination of the whole base sequence of the cDNA 
insert of clone HP10538 obtained from cDNA library of human 
osteosarcoma cell line Saos-2 revealed the structure 
consisting of a 357-bp 5 ' -untranslated region, a 1500-bp ORF, 
5 and a 1911-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 499 amino acid residues and there 
existed at least four putative transmembrane domains. Figure 
28 depicts the hydrophobic it y/hydrophilicity profile, 
obtained by the Kyte-Doolittle method, of the present 

10 protein. In vitro translation resulted in formation of a 
translation product of high molecular wexght. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the mouse pore-forming K + channel 

15 subunit (GenBank Accession No. AF056492). Table 19 shows the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and the mouse pore-forming K + 
channel subunit (MM). Therein, the marks of -, *, and . 
represent a gap, an amino acid residue identical with that 

20 of the protein of the present invention, and an amino acid 
residue similar to that of the protein of the present 
invention, respectively. The both proteins shared a homology 
of 32.4% in the N-terminal region of 241 amino acid residues. 
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Table 19 



HP MVDRGPLLTSAIIFYLAIGAAIFEVLEEPHWKEAKKNYYTQKLHLLKEFPCLGQEGLDK 
* . ...**. ** .*••**.** -.*.*. • ..*.. **..*..*.. 

5 MM MRSTTLIJ^LLALVLLYLVSGALVFQALEQPHEQQAQKKMDHGRDQFLRDHPCVSQKSLED 

HP I LEWS DAAGQG VAITGNQTFNITWNWPNAMIFAATVITTIGYGNVAPKTPAGRLF 

* * * ** .*..*..*.********.. .* ***** 

MM FIKLLVEALGGGANPETSWTNSSNHSSAWNLGSAFFFSGTIITTIGYGNIVLHTDAGRLF 
HP CVFYGLFGVPLCLTWISALGKFFGGRAKR LGQFLTKRGVSLRKAQITCTVIFIVWG 

10 *.**.* ....*. .*.. .* *. * *.*.. * 

MM CIFYALVGIPLFGMLLAGVGDPT.GSSLRRGIGHIEAIFLKWHVPPGLVRSLSAVLFLLIG 
HP VLVHLVIPPFVFMVTEGWNYIEGLYYSFITISTIGFGDFVAGVNPSANYHALYRYFVELW 

*• ...*.*** .*..*. ..*..*.****-*-* *. •* •* 

MM CLLFVLTPTFVFS YMESWSKLEAIYFVIVTLTTVGFGDYVPG-DGTGQNSPAYQPLVWFW 
15 HP I YLGLAWLS LF VNWKVS MF VE VHKAI KKRRRRRKE S FE S S PHS RKALQVKGS TAS KD VNI 

* .***... 

MM I LFGLAYF AS VLTT I GNWLRAVS RRTRAEMGGLTAQAAS WTGTVTARVTQRTGP S AP PP E 



20 Furthermore, the search of the GenBank using the base 

sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No* R25184) in ESTs, but, since they are 
partial sequences, it can not be judged whether or not any 

25 of these sequences codes for the same protein as the protein 
of the present invention. 

<HP10542> (SEQ ID Nos. 69, 79, and 89) 

Determination of the whole base sequence of the cDNA 

on ; r- c~, r +• - * -i r>nn up 10^4*? ^ h +- n i ^ o ^ fror 1 °DNA 1 ibrarv n f human 

: J -untranslated r eg ion , ; 3 2 1-tn: ORF , and ^ 4 26-np i 11 - 
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untranslated region* The ORF codes for a protein consisting 
of 10 6 amino acid residues and there existed one putative 
transmembrane domain. Figure 29 depicts the 
hydrophobic it y/hydrophilicity profile, obtained by the Kyte- 
5 Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 12 kDa that was almost identical with the molecular 
weight of 11,724 predicted from the ORF • When expressed in 
COS7 cells, an expression product of about 13 kDa was 

10 observed in the membrane fraction. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA029683) in ESTs, but, since they 

15 are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP10571> (SEQ ID Nos. 70, 80, and 90) 

20 Determination of the whole base sequence of the cDNA 

insert of clone HP10571 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 95-bp 
5 '-untranslated region, a 459-bp ORF, and a 675-bp 3'- 
untranslated region. The ORF codes for a protein consisting 

25 of 152 amino acid residues and there existed one putative 
transmembrane domain. Figure 30 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

a microsome led to the formation of a product of 23 kDa 
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which is considered to have a sugar chain being attached 
after secretion. In addition, there exists in the amino acid 
sequence of this protein one site at which N-glycosylation 
may occur (Asn-Ile-Thr at position 10). 
5 Furthermore, the search of the GenBank using the base 

sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA105822) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
10 any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP01470> (SEQ ID Nos . 91, 101, and 111) 

Determination of the whole base sequence of the cDNA 

15 insert of clone HP01470 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 157-bp 
5 '-untranslated region, a 1077-bp ORF, and a 385-bp 3'- 
untranslated region. The ORF codes for a protein consisting 
of 35 8 amino acid residues and there existed one putative 

20 transmembrane domain. Figure 31 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 43 kDa that was somewhat larger than the molecular weight 

25 of 40,489 predicted from the ORF. In this case, the addition 
of a microsome led to the formation of a product of 4 0 kDa 
from which the secretory signal is considered to have been 
cleaved and a product of 43.5 kDa which is considered to 
have been subjected to some modification. Application of the 

t ne secre to im- 
mature protein 



1 gna. s eauence, aiiows ... ^ x p e c i. i.. n a l l n e 
starts from glycine at position 23. When 
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expressed in COS7 cells, an expression product of about 44 
kDa was observed in the supernatant fraction. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
5 protein was similar to the Caenorhabditis elegans 
hypothetical protein 39.9 kDa (SWISS-PROT Accession No. 
Q10005). Table 20 shows the comparison between amino acid 
sequences of the human protein of the present invention (HP) 
and the C. elegans hypothetical protein 39.9 kDa (CE). 

10 Therein, the marks of -, *, and . represent a gap, an amino 
acid residue identical with that of the protein of the 
present invention, and an amino acid residue similar to that 
of the protein of the present invention, respectively. The 
both proteins shared a homology of 58.9% in the entire 

15 region. 
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Table 20 



HP MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDD 

* ********** # ..* . ..********* .******.** 
5 CE MRILNVSLLVLASSLVAFVECGRDFYKILGVAKNANANQIKKAYRKLAKELHPDRNQDD 

HF PQAQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGL— KDGHQSSHGDIFSHFFGDFGFMFG 
* b *★★***, .*******.*** * * # * * * * . . . * . . * * * ***** * * 
CE EMANEKFQDLSSAYEVLSDKEKRAMYDRHGEEGVAKMGGGGGGGHDPFSSFFGDF-FG-G 
HP GTPRQQDRNIPRGSDIIVDLEVTLEEVYAGNFVEWRNKPVARQAPGKRKCNCRQEMRTT 

in * * * * ** *******.*.***. *.*.* .*..*.*.****.****. 

J.V./ •• •••••••• 

~ ^«»TTf»T^TT-PTTT?TVT)vvTi TivynTcmnDnrMPDurMDipr 

CE GGGHGGEEC5TFKUAUVTXUijir'V'rijr.r.v x tNvjnr vi,iiuviu^» * w^w*^******^*- 

HP QLGPGRFQMTQEWCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVDGEPGD 
*.*.**★** * ***********.*...****.* *• .* . * **★***..*.*★* 
CE QMGQGRFQMFQVKVCDECPNVKLVQENKVLEVEVEVGADNGHQQIFHGEGEPHIEGDPGD 
15 HP LRFRIKWKHPIFERRGDDLYTNVTISLVESLVGFEMDITHLDGHKVHISRDKITRPGAK 

*.*.*.. *** **************** ****.* ***** *.. *★*.*,***. 

CE LKFKIRIQKHPRFERKGDDLYTNVTISLQDALNGFEMEIQHLDGHIVKVQRDKVTWPGAR 
HP LWKKGEGLPNFDNNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQ-KVYNGLQG 

*.**.**.*....** ** *..****.***..*..* * ..*.*..*. *.**** 

20 CE LRKKDE GMP SLEDNNKKGMLWTFDVEFPKTELSDEQKAQ I IE ILQQNTVKPKAYNGL 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
25 example, Accession No. AA282838) in ESTs , but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

on ^u-p^n? 4 1 o-> f cT?rs rn Mn^ a ? r 102, and 112) 

.•..risers ct clone H?02 4 r> cDtainea rrom c.Dlh\ library c; Human 
stomach cancer revealed the structure consisting of a 253-bp 
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5 '-untranslated region, a 681-bp ORF, and a 1120-bp 3'- 
untranslated region. The ORF codes for a protein consisting 
of 226 amino acid residues and there existed four putative 
transmembrane domains. Figure 32 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of high molecular weight. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the human hypothetical protein 
KIAA0108 (SWISS-PROT Accession No. Q15012). Table 21 shows 
the comparison between amino acid sequences of the human 
protein of the present invention (HP) and the human 
hypothetical protein KIAA0108 (KI). Therein, the marks of -, 
*, and . represent a gap, an amino acid residue identical 
with that of the protein of the present invention, and an 
amino acid residue similar to that of the protein of the 
present invention, respectively. The both proteins shared a 
20 homology of 43.9% in the entire region. 



10 



15 



WO 00/05367 



PCT/JP99/03929 



87 



Table 21 



HP MKMVAPWTRFYSNSCCLCCHVRTGTILLGVWYLIINAVVLLILLSALADPD QY 

***★****★.**.**...* 

5 KI MVSMSFKRNRSDRFYSTRCCGCCHVRTGTIILGTWYMWNLLMAILLTVEVTHPNSMPAV 

HP NFSSSELGGDFEF-MDDANMCIAIAISLLMILICAMATYGAYKQRAAWIIPFFCYQIFDF 

KI NIQYEVIGNYYSSERMADNACVLFAVSVLMFIISSMLVYGAISYQVGWLIPFFCYRLFDF 
HP ALNMLVAITVLIYPNS IQE YIRQLPPNFP YRDDVMSVNPTCLVLI ILLFIS I ILTFKGYL 
10 ****. *.* .*.**. ** *.**★.** **..*.*.* **.★* 

KT VT.SCT.VATSSLTYLPRIKEYLDQL-PDFPYKDDLLALDSSCLLFIVLVFFAT.FTTFKAYL 
HP I S CVWNC YRYINGRNS SDVLVYVT- SNDTTVLLPP YDDATVNGAAKEPPPP YVS A 

KI INCVTVNCYKYINNRNVPEIAVYPAFEAPPQYVLPTY-EMAVKMPEKEPPPPYLPA 

15 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
20 example, Accession No. AA173214) in ESTs , but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

25 <HP02631> (SEQ ID Nos. 93, 103, and 113) 

Determination of the whole base sequence of the cDNA 
insert of clone HP02 631 obtained from cDNA library of human 
osteosarcoma cell line Saos-2 revealed the structure 
consisting of a 42-bp 5 ' -untranslated region, a 588- bp ORF, 

this coaon encodes selenocysteme iron: the iuoiecu.i ■.■ 1 . ■ . ght 
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of the translation product and the sequence comparison data 
with the caenorhabditis elegans homologue. The ORP codes for 
a protein consisting of 195 amino acid residues and there 
existed a putative secretory signal at the N-terminus and 
5 one putative transmembrane domain in the intermediate region. 
Figure 33 depicts the hydrophobicity/hydrophilicity profile, 
obtained by the Kyte-Doolittle method, of the present 
protein. In vitro translation resulted in formation of a 
translation product of 58 kDa. in this case, the addition of 
10 a microsome led to the formation of a product of 56 kDa from 
which the secretory signal is considered to have been 
cleaved. Since both of these products are larger than the 
molecular weight of 22 kDa predicted from the ORF, it is 
likely that the protein interacts with another protein. 
15 The search of the protein data base using the amino 

acid sequence of the present protein revealed that the 
protein was similar to the Caenorhabditis elegans 
hypothetical protein C35C5.3 (EMBL Accession No. Z78417). 
Table 22 shows the comparison between amino acid sequences 
20 of the human protein of the present invention (HP) and the C. 
elegans hypothetical protein C35C5.3 (CE). U at position 49 
in the amino acid sequence of the protein of the present 
invention represents selenocysteine . Therein, the marks of -, 
*, and . represent a gap, an amino acid residue identical 
25 with that of the protein of the present invention, and an 
amino acid residue similar to that of the protein of the 
present invention, respectively. The both proteins shared a 
homology of 37.9% in the entire region other than the N- 
i-*rminal reaion. Cystein was found in the sequence of the C. 

4 0 cncoaea r-y tne s:,op coaor. .'tiiU!:;;jyi^—- 
of the present invention. 
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Table 22 



Hp MRLLLL 

CE MRIHDELQKQDMSRFGVFIIGVLFFMSVCDVLRTEEHSHDENHVHEKDDFEAEFGDETDS 
HP LLVAAS AMVRS EAS ANLGGVP S KRLKMQ YATGPLLKFQI CVSUGYRRVFEE YMRVISQRY 

* * . . *** **...* * 

CS QSFSQGTEEDHIEVREQSSFVKPTAVHHAKDLPTLRIFYCVSCGYKQAFDQFTTFAKEKY 
HP PDIRIEGENYLPQP IYRHI AS FLSVFKLVLIGLI IVGKDPFAFFGMQAPS IWQWGQENKV 
* . . „ ***.*. * ** *.... * .■**. **• * * * ...**. 

HP YACMMVFFLSNMIENQCMSTGAFEITLNDVPVWSKLESGHLPSMQQLVQILDNEMKLNVH 
b ★* # ** # * . * m . * , ..***.***..** *...*..*... . 

CE S S CMLVFMLGNLVEQS LI S TGAFEVYLGNEQIWSKI ES GRVPS PQEFMQLIDAQLAVLGK 
15 HP MDSIPHHRS 

CE APVNTESFGEFQQTV 



10 



20 Furthermore, the search of the GenBank using the base 

sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA156969) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 

25 any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP02695> (SEQ ID Nos . 94, 104, and 114) 

Determination of the whole base sequence of the cDNA 

S ' -untrans 1 a tea regioiw ^ lCJ20-bp ORF , ana u 160-bp 2'- 
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untranslated region. The ORF codes for a protein consisting 
of 33 9 amino acid residues and there existed three putative 
transmembrane domains. Figure 34 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 

5 Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 38 kDa that was almost identical with the molecular 
weight of 38,274 kDa predicted from the ORF. 

The search of the protein data base using the amino 

10 acid sequence of the present protein revealed that the 
protein was similar to the rat hypertension- induced protein 
S-2 fragment (PIR Accession No. 539959). Table 23 shows the 
comparison between amino acid sequences of the human protein 
of the present invention (HP) and the rat hypertension- 

15 induced protein S-2 fragment (RN). Therein, the marks of -, 
*, and . represent a gap, an amino acid residue identical 
with that of the protein of the present invention, and an 
amino acid residue similar to that of the protein of the 
present invention, respectively. The both proteins shared a 

20 homology of 74.3% in the entire region. 
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Table 23 



HP MNWELLLWLLVLCALLLLLVQLLRFLRADGDLTLLWAEWQGRRPEWELTDMVVWVTGASS 

5 HP GIGEELAYQLSKLGVSLVLSARRVHELERVKRRCLENGNLKEKDIIiVLPLDLTDTGSHEA 

*N VKRRSLENGNLKEKDILVLPLDLAP'XSSHDT 
HP ATKAVLQEFGRIDILVNNGGMSQRSLCMDTSLDVYRKLIELNYLGTVSLTKCv^,; .u.IER 

10 RN ATKTVLQEFGRIDILVNNGGVAHASLVENTNMDIFKVLIEVNYLGTVSLTKCFLPHMMER 

HP KQGKIVTVNSILGIISVPLSIGYCASKHALRGFFNGLRTELATYPGTTV^MTrpr.mrncxT 

— — — ■ w*. * ^Uil 

RN NQGKIWMKS 



15 



20 



25 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. T84331) in ESTs, but, since they arc- 
partial sequences, it can not be judged whether or k>1 any 
of these sequences codes for the same protein as che ( .<>-o : - >. 
of the present invention. 

<HP10031> (SEQ ID Nos. 95, 105, and 115) 

Determination of the whole base sequence or t)r crm- 
insert of clone HP10031 obtained from cDNA library co Human 
osteosarcoma cell line Saos-2 revealed the structure 
consisting of a 55-bp 5 ' -untranslated region, a 1^64- bp ORF, 
and a 649-bp 3 ' -untranslated region. The ORF rodes, for a 



cieDlCtS the n vH rnnhnH i r> i t- u / h^n i-r^h ^ - , ^^^, r _ 

t - - - J — z -—^>--i-vjf / iij^j. w^nxx^^i {JJ 
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by the Kyte-Doolittle method, of the present protein. In 
vitro translation resulted in formation of a translation 
product of high molecular weight. When expressed in COS7 
cells , an expression product of about 55 kDa was observed in 
5 the membrane fraction. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the Caenorhabditis elegans 
hypothetical protein CELK07H8 (GenBank Accession No. 

10 AF047659). Table 24 shows the comparison between amino acid 
sequences of the human protein of the present invention (HP) 
and the c. elegans hypothetical protein CELK07H8 (CE). 
Therein, the marks of -, *, and . represent a gap, an amino 
acid residue identical with that of the protein of the 

15 present invention, and an amino acid residue similar to that 
of the protein of the present invention, respectively. The 
both proteins shared a homology of 44.2% in the entire 
region. 
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Table 24 



HP MDGTETRQRRLD3CGKPGELGLPHPLSTGGLPVAS 

5 CE MKGGGGIGDGKKDYQSAVKEGLTTFDQLGIALEDVGKSMDAETATPGGSLFSRVIFRFRN 

HP EDGALRAPESQSVTPKPLETEPSRETAWSIGLQVTVPFMFAGLGLSWAGMLLDYFQHWPV 

* * ★ ** ** **** ** * *★ 

«.*•••• ■ • • • ••••• • 

CE ENSSLKSRTYDHSNDLVNMSVIPAESSYVLFFQVLFPFAVAGLGM^/FAGLVLSIWTWPL 

HP FVEVKDLLTLVPPLVGLKGNLEMTLASRLSTAANTGQIDDPQEQHRVISSNLALIQVQAT 

20 * *. ..*.***.*.**************** ** *..* *. ,**★*.★**** 

(J£ £'££l±'J£XJj±ljV^AljljUJLJ^iNijiM^ v lAiMUiiijvyvyAi 

HP WGLIJ^VAALLLGWSREEVDVAKVELLCA 

**..**,. * * * .*.****. ** *...*..*** ** 

CE WAFLASAFAAALAFIPSGDFDWAHGALMCASSLATACSASLVLSLLMVVVIVTSRKYNI 
15 HP NPDNIATP IAASLGDLITLS ILALVSSFFYR-HKDSRYLTPLVCLSFAALTPVWVLIAKQ 

****.***********.**. # ** . .* * . * *. .* . * * * *. **.. 

CE NPDNVATPIAASLGDLTTLTVLAFFGSVFLKAHNTESWLNVIVIVLFLLLLPFWIKIANE 
HP SPPIVKILKFGWFPIILAMVISSFGGLILSKTVSKQQYKGMAIFTPVICGVGGNLVAIQT 

* ★* * * * * * * ** ** * * * * ****** * * 

, # " " • •• • » • • • ••••••• • • • • 

20 CE NEGTQETLYNGWTPVIMSMLISSAGGFILETAV — RRYHSLSTYGPVLNGVGGNLAAVQA 

HP SRISTYLHMWSAPGVLPLQ — MKKFWPNPCSTFCTSEINSMSARVLLLLWPGHLIF-FY 
**^***^* # ## **** 9 .*„*************. * * 

CE SRLSTYFHKAGTVGVLPNEWTVSRF-TSVQRAFFSKEWDSRSARVLLLLWPGHICFNFL 
HP I-IYLVEGQSVINSQ — TF WL YLLAGL I QVT I LL YLAEVMVRLTWHQ ALDP DNH C I P YL 

or; * * ** * *** *** * * * **** ★*** 

£0\j ••••••••••• • •••• • • • 

CE IQLFTLTSKNNVTPHGPLFTSLYMIAAIIQWILLFVCQLLVALLWKWKIDPDNSVIPYL 
HP TGLGDLLGTGLLALCFFTDWLLKSKAELGGISELASGPP 

* ? ********** 0 *.* 
CE TALGDLLGTGLLFIVFLTTDHFDPKELTSS 

30 



seauences oi t ne present cDNA has revealea tne registration 
of sequences that shared a homology of 90% or more (for 
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example, Accession No. AA334000) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

5 

<HP10530> (SEQ ID Nos . 96, 106, and 116) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10530 obtained from cDNA library of human 
osteosarcoma cell line Saos-2 revealed the structure 

10 consisting of a 80-bp 5 ' -untranslated region, a 1182-bp ORF, 
and a 95-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 393 amino acid residues and there 
existed a putative secretory signal at the N-terminus . 
Figure 36 depicts the hydrophobicity/hydrophilicity profile, 

15 obtained by the Kyte-Doo little method, of the present 
protein. In vitro translation resulted in formation of a 
translation product of 4 6 kDa that was somewhat larger than 
the molecular weight of 44,912 predicted from the ORF. In 
this case, the addition of a microsome led to the formation 

20 of a product of 4 5.5 kDa from which the secretory signal is 
considered to have been cleaved. Application of the (-3,-1) 
rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from lysine at position 23. When expressed in 

25 COS7 cells, an expression product of about 43 kDa was 
observed in the supernatant fraction and the membrane 
fraction. 

The search of the protein data base using the amino 

acid seauence of the present protein revealed that the 

protein IGGC2N0i iGenBanK Accession No . AFUG7269). laoj-e u 
shows the comparison between amino acid sequences of the 
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human protein of the present invention (HP) and the A, 
thaliana hypothetical protein IG002N01 (AT). Therein, the 
marks of *, and . represent a gap, an amino acid residue 
identical with that of the protein of the present invention, 
5 and an amino acid residue similar to that of the protein of 
the present invention, respectively. The both proteins 
shared a homology of 27.0% in the N-terminal region of 355 
amino acid residues . 
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Table 25 



HP 

MRTLFNLLWL 

5 AT MELTSFQKSPSSNDWSFSVSLVRNSMARRRRSSAAESLKRRNDGYESLCQWQQDSDRR 

HP ALACSPVHTTLSKSDAKKAAS KTLLEKSQFSDKPVQDRGLVVTDLKAESVVLEHRSYCSA 

AT LITIFVIFFIVIPAVSIAVYKVKFADRVIQTESSIRQKGIVKTDINFQEILTEHSK AS 

HP KARDRHFAGDVLGYVTPWNSHGYDVTKVFGSKFTQISPVWLQ-LKRRGREMFEVTGLHDV 
10 -...**.. **.*.♦* ..* * *.* *..**... . ** 

• • • • • 

AT ENSTRHYDYPVLAYITP— CQGSGL— VLEGR-HNADKGWIQELRSRGNALSASKGLPKL 
HP DQGWMRAVRKHAKGLHIVPRLLFEDWTYDDFRNVLDSEDEIEELSKTWQVAKNQHFDGF 

' ' • * ••• • * • ** .*.* *.* 

AT — YNSCIFHALKRMNFFTLELVNFNTYLVIMFALNS-REMEYNGIVLESWSRWAAYGVL 
15 HP VVEW7NQLLSQKRVGLIHMLTHLAEALHQARLLALLVIPPAITPGTDQLGMFTHKEFEQL 

**• * * • * ............ . • • ■ * . *•••••«* 

AT HDPDLRKMALKFVKQLGDALHSTSSPRNNQQHMQFMYWGPPRSEKLQMYDFGPEDLQFL 

HP APVLDGFSLMTYDYSTAHQPGPNAPLSWVRACVQ-VLDPKSK WRSKILLGLNFYGM 

•*********.*....******..*. .* .*.***.**♦* 

20 AT KDSVDGFSLMTYDFSNPQNPGPNAPVKWIDLTLKLLLGSSNNIDSNIARKVLLGINFYGN 
HP DYATSKDAREPWGARYIQTLKDHRPRMVWDSQASEHFFEYKKSRSGRHWFYPTLKSLQ 
*•'•* * *■• *..*.* . ** **.* * *.******.♦. 

IL 



AT DFVISGGGGGAITGRDYLALLQKHKPTFRWDKESGEHLFMYRDDKNIKHAVFYPTLMS 



HP VRLELARELGVGVS IWELGQGLDYFYDLL 
25 . *** ** *.*.****.**. 

AT LRLENARLWGIGISIWEIGQDKGHFGKYAEASLEASSIFSGHTFDMQFRTNPRQLSRNGS 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
30 of sequences that shared a homology of 90% or more (for 
example, Accession No. AA302913) in FSTs . bu+- 



sequences roaes 



protein 
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protein of the present invention. 

<HP10541> (SEQ ID Nos . 97, 107, and 117) 

Determination of the whole base sequence of the cDNA 
5 insert of clone HP10541 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 7-bp 
5 '-untranslated region, a 591-bp ORF, and a 113-bp 3'- 
untranslated region. The ORF codes for a protein consisting 
of 196 amino acid residues and there existed a putative 

10 secretory signal at the N-terminus . Figure 37 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 23 kDa that was somewhat larger thc:n the molecular weight 

15 of 21,553 predicted from the ORF. In this case, the addition 
of a microsome led to the formation of a product of 20 kDa 
from which the secretory signal is considered to have been 
cleaved and a product of 23 kDa which is considered to have 
a sugar chain being attached. Application of the (-3,-1) 

20 rule, a method for predicting the cleavage site of the 
secretory signal sequence, allows to expect that the mature 
protein starts from glycine at position 41 . In addition, 
there exists in the amino acid sequence of this protein one 
site at which N-glycosylation may occur (Asn-Leu-Thr at 

25 position 185 ) . 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the human zymogen membrane protein 
(GenBank Accession No. AF056492). Table 26 shows the 



the t resent .invention (HP) ana cne numan zymogen membrane 
protein (ZM). Therein, the marks of *, and . represent a 
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gap, an amino acid residue identical with that of the 
protein of the present invention, and an amino acid residue 
similar to that of the protein of the present invention, 
respectively. The both proteins shared a homology of 37.6% 
5 in the C-terminal region of 133 amino acid residues. 



Table 2 6 



HP MWRVPGTTRRPVTGESPGMHRPEAMLLLLTLALLGGPTWAGKMYGPGGGKYFS-TTEDYD 
10 *★ .... * 

ZM MLTVALLALLCASASGNAI QARS S S YS GE YGS GGGKRFSHS GNQLD 

HP HEITGLRVSVGLLLVKSVQVKLGDSWDVKLGALGGNTQEVTLQPGEYITKVFVAFQAFLR 

ZM GPITALRVRVNTYYIVGLQVRYGKVWSDYVGGRNGDLEEIFLHPGESVIQVSGKYKWYLK 
15 HP GMVMYTSKDRYFYFGKLDGQI SS AYPSQEGQVLVGIYGQYQLLGIKSIGFEWN-YPLEEP 

# * # * # * # * * # *** m ft # * * # a ** * * . * *..**..*. * * 

ZM KLVFVTDKGRYLSFGKDSGTSFNAVPLHPNTVLRFISGRSGSL-IDAIGLHWDVYPTSCS 
HP TTEPPVNLTYSANSPVGR 



20 ZM RC 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
25 of sequences that shared a homology of 90% or more ( for 
example, Accession No. AA340605) in ESTs , but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 



NA 
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insert of clone HP10550 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 241-bp 
5 '-untranslated region, a 324-bp ORF, and a 86-bp 3'- 
untranslated region. The ORF codes for a protein consisting 

5 of 107 amino acid residues and there existed one putative 
transmembrane domain. Figure 38 depicts the 
hydrophobic ity/hydrophilicity profile, obtained by the Kyte- 
Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

10 of high molecular weight. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA348310) in ESTs, but, since they 

15 are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP10590> (SEQ ID Nos. 99, 109, and 119) 

20 Determination of the whole base sequence of the cDNA 

insert of clone HP10590 obtained from cDNA library of human 
fibrosarcoma cell line HT-1080 revealed the structure 
consisting of a 77-bp 5 ' -untranslated region, a 1053-bp ORF, 
and a 180-bp 3 ' -untranslated region. The ORF codes for a 

25 protein consisting of 350 amino acid residues and there 
existed one putative transmembrane domain. Figure 3 9 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doo little method, of the present protein. In vitro 
translation resulted in formation of a translation product 

addition of a microsome led to the formation of a product o£ 
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43 kDa which is considered to have a sugar chain being 
attached. In addition, there exist in the amino acid 
sequence of this protein two sites at which N-glycosylation 
may occur (Asn-Asn-Ser at position 144 and Asn-Leu-Thr at 
5 position 328) . 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA461346) in ESTs, but, since they 
10 are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP10591> (SEQ ID Nos. 100, 110, and 120) 

15 Determination of the whole base sequence of the cDNA 

insert of clone HP10591 obtained from cDNA library of human 
fibrosarcoma cell line HT-1080 revealed the structure 
consisting of a 232-bp 5 ' -untranslated region, a 324-bp ORF, 
and a 844-bp 3 ' -untranslated region. The ORF codes for a 

20 protein consisting of 107 amino acid residues and there 
existed one putative transmembrane domain. Figure 4 0 depicts 
the hydrophobicity/hydrophilicity profile, obtained by the 
Kyte-Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 

25 of 12 kDa that was almost identical with the molecular 
weight of 11,328 predicted from the ORF. 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 9 0% or more ( for 

Dartiai sequences, :. can not. ne judqeci wnetner ci. not. any 
of these sequences codes for the same protein as the protein 
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of the present invention. 

<HP01462> (SEQ ID Nos . 121, 131, and 141) 

Determination of the whole base sequence of the cDNA 
5 insert of clone HP01462 obtained from cDNA library of huuan 
fibrosarcoma cell line HT-1080 revealed the struck ce 
consisting of a 121-bp 5 '-untranslated region, a 1452-bp ORF, 
and a 477-bp 3 '-untranslated region. The ORF codes fo. a 
protein consisting of 483 amino acid residues and there 

10 existed a putative secretory signal at the N- terminus. 
Figure 41 depicts the hydrophobicity/hydrophilicity profile, 
obtained by the Kyte-Doolittle method, of the present 
protein. In vitro translation resulted in formation of a 
translation product of 72 kDa that was larger than the 

15 molecular weight of 55,838 predicted from the 0<t'- 
Application of the (-3,-1) rule, a method for predict; nc,' the 
cleavage site of the secretory signal sequence, allows to 
expect that the mature protein starts from lysine at 
position 21. 

20 The search of the protein data base using t'ncr amino 

acid sequence of the present protein revealed f!vji; the 
protein was similar to the Caenorhabditis elegans 
hypothetical protein ZK1058.4 (EMBL Accession No ■ ■ ib*r»i). 
Table 27 shows the comparison between amino acid - -u mc-- 

25 of the human protein of the present invention (HP, \;.o the 

elegans hypothetical protein ZK1058.4 (CE). Therein, the 
marks of -, *, and . represent a gap, an amine; acirf residue 
identical with that of the protein of the present invention, 
and an amino acid residue similar to that of the pr.ote.in c,f. 



• :.ci r e u ^ -. n oir.c- 1 o q y u ... 
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HP MKAFHTFCWLLVFGSVSEAKFDDFEDEEDIVEYDDNDFAEFEDVMEDSVTESPQRVIIT 

* * 

5 CE MKIVWIFLIFFIGFAIST 

HP EDDE-DETTVELEGQDENQEGDFEDADTQEGDTESEPYDDEEFEGYEDKP D 

.*.* • * . * . * . .... •••*•*. *••* * 

CE DDNEFAEFEDEFVGSSATQAPEIQREGEPPVLKQKDDFEEEDFGVVEEEPEEAEKVREAD 
HP TSSSKNKDPITIVDVPAHLQNSWESYYLEILMVTGLLAYIMNYIIGKNKNSRLAQAWFNT 
10 *....*****....*.** .* .* *. **.***..*...*★. *. 

CE SDDAAPAQPLKFADVPAHFRSNWASYQVEGIVVT.IILIYMTNYLIGKTTNASIAQTIFDM 

HP HRELLESNFTLVGDDGTNKEATSTGKLNQENEHIYNLWCSGRVCCEGMLIQLRFLKRQDL 

* *★..*..****** * ** # *** * **** # 

CE CRPTLEEQFAWGDDGTTDLDKMIPSLKHDTDSTFSAWCTGRVNVNSLFLQMKMVKRQDV 
15 HP LNVLARMMRPVSDQVQIKVTMN-DEDMDTYVFAVGTRKALVRLQKEMQDLSEFCSDKPKS 
• • . *. * .*•. ** * *. .****..* . *** **..* *., mm 

CE VSRIMEMFTPSGDKMTIKASLETTNDTDPLIFAVGEKKIASKYFKEMLDLNS FASERKQA 
HP GAKYGLPDSLAILSEMGEVTDGMMDTKMVHFLTHYADKIESVHFSDQFSGPKIMQEEGQP 
**. .* .*....* ** .★.****.*** mm * m . 

20 CE AQQFNLPASWQVYADQNEWFSILDPGWSLLKKHEDAIEFIHISDQFTGPKPAEGESYT 

HP LKLPDTKRTLLFTF^A^PGSGNTYPKDMEALLPLMN^^VIYSIDKAKKFRLNREGKQKADKN 

.**...* * .* * *.*.* ****.*..*....* **... 

CE -RLPEAQRYMFVSLNLQYLG QDEESVMEILNLVFYLIDKARKMKLSKDAKVKAERR 

HP RARVEENFLKLTHVQRQEAAQSRREEKKRAEKERIMNEEDPEKQRRLEEAALRREQKKLE 
25 * ** ****** ^ ***** # * ^ *...*.*.***.*.***. ,* # ** * 

CE RKEFEDAFLKQTHQFRQEAAQARREEKTRERKQKLMDESDPERQKRLEAKELKREAKA — 
HP KKQMKMKQ I KVKAM 

* ** ** ^ ** * 

CE -KSPKMKQLKVK 

30 _ 



- equences o : t no p r eseni c D N A n as reveaiea t. n e ; egistration 
of sequences that shared a homology of 90% or more (for 
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example, Accession No. AA307 793) in ESTs, but, since they 
are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

5 

<HP02485> (SEQ ID Nos. 122, 132, and 142) 

Determination of the whole base sequence of the cDNA 
insert of clone HP024 85 obtained from cDNA library of human 
stomach cancer revealed the structure consisting of a 69-bp 

10 5 '-untranslated region, a 1005-bp ORF, and a 1672-bp 3'- 
untranslated region. The ORF codes for a protein consisting 
of 334 amino acid residues and there existed one putative 
transmembrane domain. Figure 42 depicts the 
hydrophobicity/hydrophilicity profile, obtained by the Kyte- 

15 Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 36 kDa that was almost identical with the molecular 
weight of 38,171 predicted from the ORF . When expressed in 
COS7 cells, an expression product of about 23 kDa was 

20 observed in the membrane fraction. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the Caenorhabditis elegans 
hypothetical protein W01A11.2 (GenBank Accession No. U6 4 852). 

25 Table 28 shows the comparison between amino acid sequences 
of the human protein of the present invention (HP) and the C. 
elegans hypothetical protein W0 1 All. 2 (CE). Therein, the 
marks of - , * , and . represent a gap , an amino acid res idue 
identical with that of the protein of the present invention, 

- : . l e o e : i ..... .a v eni^on, . es p cc:;vei y . - n u o l : , ;.:o:eiiib 

snared a homology of 45.5% xn the entire region . 
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Table 28 



MVEFAPLFMPWERRLQTLAVLQFVFSFLALAEICT-V 

nir 

.***..**.***.****. *•* *. . * 
CE MRLRLSSISGKAKLPDKEICSSVSRILAPLLVPWKRRLETLAVMGFIFMWVILPIMDLWV 
HP GFIALLFTRFWLLTVLYAAWWYLDRDKPRQGGRHIQAIRCWTIWKYMKDYFPISLVKTAE 

* .*. * *.*....*• • * • ***' •***••*•***• 

CE PFHVLFNTRWWFLVPLYAVWFYYDFDTPKKASRRWNWARRHVAWKYFASYFPLRLIKTAD 
HP LDPSRNYIAGFHPHGVLAVGAFANLCTESTGFSSIFPGIRPHLMMLTLWFRAPFFRDYIM 
* **** **.* *..***.. ****..*.* *. * ** *.. • 

• • - " " 

CE LPADRNYIIGSHPHGMFSVGGFTAMSTNATGFEDKFPGIKSHIMTLNGQFYFPFRREFGI 
HP SAGLVTSEKESAAHILNRKGGGNLLGIIVGGAQEALDARPGSFTLLLRNRKGFVRLALTH 

* .. * *• .*••*** ***.*.*•• ** * ***** • **• 

CE MLGGIEVSKESLEYTLTKCGKGRACAIVIGGASEALEAHPNKNTLTLINRRGFCKYALKF 

HP GAPLVPIFSFGENDLFDQIPNSSGSWLRYIQNRLQKIMGISLPLFHGRGVF-QYSFGLIP 

** ***.„.*★****.„* *..**.** •* ' 

CE GADLVPMYNFGENDLYEQYENPKGSRLREVQEKIKDMFGLCPPLLRGRSLFNQYLIGLLP 
HP YRRPITTWGKPIEVQKTLHPSEEEVNQLHQRYIKELCNLFEAHKLKFNIPADQHLEFC 

.*.*.***.*..* * .* .*. *....**..* ..*.****..* •**•* ** * 

CE FRKPVTTVMGRPIRVTQTDEPTVEQIDELHAKYCDALYNLFEEYKHLHSIPPDTHLIFQ 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. D25664) in ESTs, but, since they are 
partial sequences, it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
of the present invention. 



v 



Determination of the whole base sequence of the cDNA 
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10 



15 



20 



25 



insert of clone HP02798 obtained from cDNA library of human 
fibrosarcoma cell line HT-1080 revealed the structure 
consisting of a 31-bp 5 ' -untranslated region, a 804-bp ORF, 
and a 301-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 267 amino acid residues and there 
existed four putative transmembrane domains. Figure 43 
depicts the hydrophobicity/hydrophilicity profile, obtained 
by the Kyte-ooolittle method, of the present protein, m 
vitro translation resulted in formation of a translation 
product of 29 kDa that was almost identical with the 
molecular weight of 30,7/8 predicted from the ORF. when 
expressed in COS7 cells., an expression product of about 26 
kDa was observed in the membrane fraction. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 
protein was similar to the human DHHC-containing cysteine- 
rich protein (GenBank Accession No. U90653). Table 29 shows 
the comparison between amino acid sequences of the human 
protein of the present invention (HP) and the human DHHC- 
containing cysteine-rich protein (DH) . Therein, the marks of 
-/ *, and . represent a gap, an amino acid residue identical 
with that of the protein of the present invention, and an 
amino acid residue similar to that of the protein of the 
present invention, respectively. The both proteins shared a 
homology of 35.0% in the intermediate region of 100 amino 
acid residues. The positions of seven cysteines were 
conserved between the two proteins. The protein of the 
present invention also had the DHHC (Asp-His-His-Cys ) 
sequence. 
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Table 29 



Hp MAP WALL S P GVL VRTG HT VLTWG I 

5 DH MYKMNICNKPSNKTAPEKSVWTAPAQPSGPSPELQGQRSRRNGWSWPPHPLQIVAWLLYL 

HP TLVLFLHDTELRQWEEQGELLLPLTFLLLVLGSLLLYLAVSLMDPGYVNVQPQP-QEELK 

**...*.. . ** . ** * 

DH FFAVIGFGILVPLLPHHWVPAGYACMGAIFAGHLWHLTAVSIDPADDNVRDKSYAGPLP 
HP EEQTAMVPPAIPLRRCRYCLVLQPLRARHCRECRRCVRRYDHHCPWMENCVGERNHPLFV 

n * * * * *..**..*..** .**** *..**★****..**. 

DH IFNP.SQHAHVTEDT.HCNLCNVDVSARSKHCSACNKCVCGFDHHCKWLNNCVGERNYRLFL 
HP VYLALQLWLLWGLYLAWSGLRFFQPWGLWLRSSGLLFATFLLLSLFSLVASLLLVSHLY 

.* . * 

DH HSVASALLGVLLLVLGGHICLRGVLCQPHASAHQPTL 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
20 example, Accession No. D79050) in ESTs, but, since they are 
partial sequences, it can not be judged whether or not any 
of these sequences codes for the same protein as the protein 
of the present invention. 

25 <HP10041> (SEQ ID Nos . 124, 134, and 144) 

Determination of the whole base sequence of the cDNA 
insert of clone HP10041 obtained from cDNA library of human 
osteosarcoma cell line Saos-2 revealed the structure 
consisting of a 12 -bp 5 ' -untranslated region, a 321-bp ORF, 

existed one nutaiive Transmembrane domain. Figure 4 4 depicts 



WO 00/05367 . i9 

107 

the hydrophobicity/hydrophilicity profile, obtain' I b a the 
Kyte-Doolittle method, of the present protein. In vitro 
translation resulted in formation of a translation product 
of 12 kDa that was almost identical with the molecular 
5 weight of 12,060 predicted from the ORF. When expressed in 
COS7 cells, an expression product of about 13 itDa was 
observed in the membrane fraction. 

The search of the protein data base using the am^no 
acid sequence of the present protein revealed that :.he 

10 protein was similar to the Caenorhabditis elegans 
hypothetical protein K10B2.4 (GenBank Accession No. U28730',. 
Table 3 0 shows the comparison between amino acid sequences 
of the human protein of the present invention (HP) and the C. 
elegans hypothetical protein K10B2.4 (CE) . Therein, the 

15 marks of *, and . represent a gap, an amino acid re^Jriue 
identical with that of the protein of the present invention, 
and an amino acid residue similar to that of the protein of 
the present invention, respectively. The both proteins 
shared a homology of 62.1% in the entire region. 



20 



Table 30 



HP MSTNNMSDPRRPNKVLRYKP PPSECNPALDDPTPDYMNLLGMIFSMCGLM- s ; :',WCA 

^ * * * * ★ * * * * ***************** 

25 CE MQQNGDPRRTNRIVRYKPLDSTANQQQAISEDPLPEYMNVLGMIFSMCGLM'- ^ WHS 

HP WVAVYCSFISFANSRSSEDTKQMMSSFMLSISAWMSYLQNPQPMTPPW 
*.*. ** ***** a * ^ * b * 9 ** m m ****** m ********** * *^ # *** 

CE WLALVCSCISFANTRTSDDAKQIVSSFMLSVSAWMSYLQNPSPIIPPW^/TiJ.O^ 



F' art he mi or* , * n<_- sear c : i o i t. r. e GenBa ?. 
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sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. H20098) in ESTs, but, since they are 
partial sequences, it can not be judged whether or not any 
5 of these sequences codes for the same protein as the protein 
of the present invention. 

<HP10246> (SEQ ID Nos. 125, 135, and 145) 

Determination of the whole base sequence of the cDNA 

10 insert of clone HP10246 obtained from cDNA library of human 
epidermoid carcinoma cell line KB revealed the structure 
consisting of a 110-bp 5 ' -untranslated region, a 675-bp ORF, 
and a 79-bp 3 ' -untranslated region. The ORF codes for a 
protein consisting of 224 amino acid residues and there 

15 existed five putative transmembrane domains. Figure 45 
depicts the hydrophobicity/hydrophilicity profile, obtained 
by the Kyte-Doolittle method, of the present protein. In 
vitro translation resulted in formation of a translation 
product of 23 kDa that was somewhat smaller than the 

20 molecular weight of 25,2 44 predicted from the ORF. When 
expressed in COS7 cells, an expression product of about 21 
kDa was observed in the membrane fraction. 

The search of the protein data base using the amino 
acid sequence of the present protein revealed that the 

25 protein was similar to the human putative seven 
transmembrane domain protein (GenBank Accession No. Y18007). 
Table 31 shows the comparison between amino acid sequences 
of the human protein of the present invention (HP) and the 
human putative seven transmembrane domain protein (TM) . 

— . .^.u^i:^x^a^ ^. l;. :naL ....... me piotciM < : t:ne 

present invention, and an amino acid residue similar to that 
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of the protein of the present invention, respectively. The 
both proteins shared a homology of 93 .3% in the entire 
region. 

5 Table 31 



HP MTLFHFGNCFALAYFPYFITYKCSGLSEYNAFWKCVQAGVTYLFVQLCKMLFLATFFPTW 
*********************** m 9 *********************************** 

TM MTLFHFGNCFAIAYFPYFITYKCTDLSEYNAFWKCVQAGVTYLFVQLCKMLFLATFFPTW 
10 HP EGGIYDFIGEFMKASVDVADLIGLNLVMSRNAGKGEYKIMVAALGWATAELIMSRCIPLW 

**************************** ************** ******** ********** 

TM EGGIYDFIGEFMKASVDVADLIGLNLVMSRNAGKGEYKIMVAALGWATAELIMSRCIPLW 

HP VGARGIEFDWKYI QMS IDSNI SLVH YIVAS AQVWMITRYDLYHTFRPAVLLLMFLSVYKA 
*********************** m ******************************.**** 

15 TM VGARGIEFDWKYI QMS ID SNISLGPY I VASAQVWMITRYDLYHTFRPAVLLLMFLRVYKA 

HP FVMETFVHLCSLGSWAALLARAWTGLLALSTLALYVAWNVHS 
****************.*.* .**.***. . . # ** # ********* 

TM FVMETFVHLCSLGSWAVLMAGVWKGLLVIRNLAMYVAWNVHS 



20 

Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 
example, Accession No. AA453931) in ESTs , but, since they 
25 are partial sequences, it can not be judged whether or not 
any of these sequences codes for the same protein as the 
protein of the present invention. 

<HP10392> (SEQ ID Nos. 126, 136, and 146) 

osteosarcoma cell line u-2 OS revealed the structure 
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_ ,, 9 and 149) 
<HP10531 > (SEQ » — 129 ' ^ ' e fcase s e,uence of the cDX* 
^termination of the *mo> ^ library of human 

insert o £ done HP 1 C53 1 oot^d ^ ^ structure 

osteoS arco,a eel! Une region , . 1035-oP <«. 

consisting of a S5-OP 5-unt «■ ^ ORr code s tor a 

and . 1092 - b p and rr. 

protein consisting - --_ ne domains. .- 

existed nve putative ««- ity pro£ ile, obtained 

depic ts the hydrophohicity/hyd P ^ prQtein . n 

by the -te-oooHtt. ^ of . traction 

vitE0 translation result 

product of hi 9 h -lecular weight ^ ^ base 

F urthermore, the search of ^ teglstratl on 

sconces o £ the present - ^ ^ or note (£ „r 

of seances that shared * ^ ^ they are 

e^pie, session »o. ^ or not any 

partiai sequences, « - J ^ prot ein as the protein 
of these sequences codes for 
ol the present invention. 

Mt> (SE0 » »os. »0. »0 ^ 5 °^ e of the «« 
oetermination of the *>> litrary of human 

in sert of clone HP10- o^J^ o£ a 

stomach cancer revealed the «x ^ ^ , . 

..^translated region, . » e ^ f<)r , pto tein consisting 
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by the Kyte-Doolittle method, of tt. present protein In 
Y 1fw , in formation of a translation 

vitro translation resulted in form 

product of high molecular weight. Application of the (-3,-1) 
rule, a method for predicting the cleavage site of the 
"retory signal sequence, allows to expect that the mature 
protein starts from serine at position 36. 

P k * 4-v,p> nrotein data base using the ammo 

The search of the protein u 

* *-v, 0 nresent protein revealed that the 
acid sequence of the present P roLlATH 
• to the Drosophila melanogaster GOLIATH 

protein was simile to the 

protein (SWISS-PROT Accession no. wu-^- ™-- " 
the comparison between amino acid sequences of the . human 
protein of the present invention (HP) and the D. 
Llanogaster GOLIATH protein (DM). Therein, the , «ar* ^ -. 

« aao an amino acid residue identical 
*, and . represent a gap, an 

_ •« the nresent invention, and an 
with that of the protein of the pres 

• to that of the protein of -he 

amino acid residue similar to tnax: ^ 

. or , Hve i v The intermediate region of 

nresent invention, respectively, me 

* ^ protein of the present invention 
169 amino acids of the protein * . 

„f di 4% with the N-terminal region of the 
shared a homology of 41.4% wiw 

D. melanogaster GOLIATH protein. 
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HP MGPPPGAGVSCRGGCGFSRLLAWCFLLALS 



p qap g s rg ae av wtaylnv s wrvp ht g vnr 
hp TVWELS e E =vvoqdsp I , E pv AG vi.vppdgp GM .»a COT h™ftwtvw G stv C vsw L m. I 

HP QRGGGCTFADKIHLAYERGASGAVIFNFPGTBNEVIPMSHPGAVDIVAIMIGNLKGTKIL 

MQLEKMQ IKGKTRN I AAVI T Y QNI GQDLS 
HP QS IQRGI QVTMVIE VGKK HGPWVNHYS IFFVS VSFF I ITAATVGYF I F YS ARJRLRNA 



.r.-ro T-Ttr_nnTT.rWT.TFYYlORFRYM 

DM LTLDKGYNVTISIIEGRRGVRTISSLNRT&vx.rv i »x— - 

HP PAQSRKQBQI'KADAKKAIGRIjQLRTLKQGDKEIGPDGDSCAVCIELyKPSDLVRILTCNH 

OHQ^OSSPMLC^K^MK^T^OKFSD-EKDLDSaCCAICIEAVKPTDIIRI.PCKH 
HPIPHKTCVDPW.LEHP.XCPMCKCD^KALGIEVDVEDGSVSMVPVSNEISHSASSHEEPH 

B „ EFHKKciDP^EHPTCP™ 
HP RSETASSGlfASVQGTDEPPLEEHVQSTNESLQLVNHEANSVAVDVIPHVDNPTFEEDETP 

^VPHGPQPI-CPLQAS^SSFAPSHVF^SRSPSSSVOMLAPLTVQPHPQQAASERGRHHS 
20 HP NQETAVREIKS 

DM APATMPHAITASHQVTDV 



Furthermore, the search of the GenBank using the base 
sequences of the present cDNA has revealed the registration 
of sequences that shared a homology of 90% or more (for 

,,.,,.,(,0 i n ests, but, since they 
example, Accession No. AA155685) m ESTS, 

... - ran not. be nudged whether or not 

nrotem oi the present invention. 
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R ccordingly, the - * oarcinostat ic agents 
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which act t antigens for prepare 

dit£ erentiatlo„ o ^ je Is e ^ mhs o£ the present 
antibodies agaxnst these pr ^ ^ 

mention can be ^ therapy . Fu rther,ore, 

gnosis ^ ti ;7;;; s £ :; r larae laie * — 

the DHAs can be utilized I intr oduccd to 
proteins. Cells into which ^ ^ of the 

express these proteins, can o£ „ ovel low . 
corresponding receptors and Uganda 

Secular pharmaceuticals «° ^ corrMpondill , 

represent invention also P- 

the nolvnucleotide sequences, 
to the poly Qf ^ genome thgt are 

••Corresponding genes are the ^ 

transcribed to produce the aiaa* 

arp derived anu nu. y 
polynucleotide sequences are ^ 

contiguous regions of the genorce nectary O( _ £oix 

expression of such genes. Correspond! g 

- limited to coding sequencer, j ^ d 
include but are not limited t ^ ^ 
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Herein. Such methods include tl„ prepare^,-. 
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through insertion, preferably followed by imprecise excision, 
of transposable elements (Plasterk, 1992, Bioessays 14(9): 
629-633; Zwaal et al., 1993, Proc. Natl. Acad. Sci. USA 
90(16): 7431-7435; Clark et al., 1994, Proc. Natl. Acad. Sci. 
5 USA 91(2): 719-722; all of which are incorporated by 
reference herein) , or through homologous recombination, 
preferably detected by positive/negative genetic selection 
strategies (Mansour et al., 1988, Nature 336: 348-352; U.S. 
Patent Nos. 5,464,764; 5,487,992; 5,627,059; 5,631,153; 

10 5,614, 396; 5,616,491; and 5,679,523 ; all of which are 
incorporated by reference herein). These organisms with 
altered gens expression are preferably eukaryotes and more 
preferably are mammals. Such organisms are useful for the 
development of non-human models for the study of disorders 

15 involving the corresponding gene(s), and for the development 
of assay systems for the identification of molecules that 
interact with the protein product (s) of the corresponding 
gene(s). Where the protein of the present invention is 
membrane-bound (e.g., is a receptor), the present invention 

20 also provides for soluble forms of such protein. In such 
forms part or all of the intracellular and transmembrane 
domains of the protein are deleted such that the protein is 
fully secreted from the cell in which it is expressed. The 
intracellular and transmembrane domains of proteins of the 

25 invention can be identified in accordance with known 
' tm^^ for determination of such domains from sequence 

Proteins a!,w t : K - - — - ; r>r->. 
include proteins with amino acid sequence 

'M) at least 25% (more preferably at least 50%, and most 

preferably at least, 75%) of the length of a disclosed 

protein and have at least, 60% sequence identity (more 
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preferably, at least 75% identity; most preferably at least 
90% or 95% identity) with that disclosed protein, where 
sequence identity is determined by comparing the amino acid 
sequences of the proteins when aligned so as to maximize 
5 overlap and identity while minimizing sequence gaps. Also 
included in the present invention are proteins and protein 
fragments that contain a segment preferably comprising 8 or 
more (more preferably 20 or more, most preferably 3 0 or 
more) contiguous amino acids that shares at least 75% 
10 sequence identity (more preferably, at least 85% identity; 
most preferably at least 95% identity) with any such segment 
of any of the disclosed proteins . 

Species homologs of the disclosed polynucleotides and 
proteins are also provided by the present invention. As 

15 used herein, a "species homologue" is a protein or 
polynucleotide with a different species of origin from that 
of a given protein or polynucleotide, but with significant 
sequence similarity to the given protein or polynucleotide, 
as determined by those of skill in the art. Species 

20 homologs may be isolated and identified by making suitable 
probes or primers from the sequences provided herein and 
screening a suitable nucleic acid source from the desired 
species . 

The invention also encompasses allelic variants of the 
25 disclosed polynucleotides or proteins; that is, naturally- 
occurring alternative forms of the isolated polynucleotide 
which also encode proteins which are identical, homologous, 

.ne . ;.ve:., . ..iv. 

30 sequences complementary to those of the polynucleotides 

disclosed herein - 

The present .lnventi.cn also includes polynucleotides 
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capable of hybridizing under reduced stringency conditions , 
more preferably stringent conditions, and most preferably 
highly stringent conditions, to polynucleotides described 
herein. Examples of stringency conditions are shown in the 
5 table 3 3 below: highly stringent conditions are those that 
are at least as stringent as, for example, conditions A-F; 
stringent conditions are at least as stringent as, for 
example, conditions G-L; and reduced stringency conditions 
are at least as stringent as ,- for example, conditions M-R. 
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Table 33 



Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
Length 
(bp)* 


Hybridization Temperature 
and Buffer* 


Wash 

Temperature 
and Buffed 


A 


DNA : DNA 


£50 


65^; lxSSC -or- 

42°C; lxSSC,50% formamide 


65t; 0.3xSSC 


B 


DNA : DNA 


<50 


T B *; lxSSC 


T B *; lxSSC 


C 


DNA : RNA 


£50 


67*C; lxSSC -or- 

45t; lxSSC,50% formamide 


67^; 0.3xSSC 


D 


DNA : RNA 


<50 


T D *; lxSSC 


T D *; lxSSC 


E 


RNA : RNA 


>50 


70°C; lxSSC -or- 

50°C; lxSSC,50% formamide 


70°C; 0.3xSSC 


F 


RNA : RNA 


<50 


T F *; lxSSC 


T F *; lxSSC 


G 


DNA : DNA 


>50 


65°C; 4xSSC -or- 

42°C; 4xSSC,50% formamide 


65°C; lxSSC 


H 


DNA : DNA 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


I 


DNA : RNA 


>50 


67°C; 4xSSC -or- 

45°C; 4xSSC,50% formamide 


67°C; lxSSC 


J 


DNA : RNA 


<50 


T/; 4xSSC 


Tj*; 4xSSC 


K 


RNA : RNA 


>50 


70^C; 4xSSC -or- 

50X:-, 4xSSC,50% formamide 


67°C; lxSSC 


L 


RNA : RNA 


<50 


T L *; 2xSSC 


T L *; 2xSSC 


M 


DNA : DNA 


>50 


50°C; 4xSSC -or- 

40°C; 6xSSC,50% formamide 


50t; 2xSSC 


N 


DNA : DNA 


<50 


T N *; 6xSSC 


T N *; 6xSSC 


0 


DNA : RNA 


>50 


55°C; 4xSSC -or- 

A2°C; 6xSSC,50% formamide 


55°C; 2xSSC 


P 


DNA : RNA 


<50 


T P *; 6xSSC 


T P *; 6xSSC 


Q 


RNA : RNA 


>50 


60°C; 4xSSC -or- 

451C; 6xSSC,50% formamide 


60°C; 2xSSC 


R 


RNA : RNA 


<50 


T R *; 4xSSC 


T R *; 4xSSC 



t : The hybrid length is that anticipated for the hybridized region(s) of the 
hybridizing polynucleotides. When hybridizing a polynucleotide to a target 
5 polynucleotide of unknown sequence, the hybrid length is assumed to be that of the 
hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, 

• • r- of th^ eolvnu^l^otid^s 

seven u"i':' " ' • 'L r i' c; ' : * :::: i. ra - •!!• • ::.e.- :e>e ;v 

T SSPK i 1>'vSSPK is u. if)M . NaCi. iUmM I\aa.i s , 4r ana . -01:.... . . . ; 

10 can be substituted for SSC (lxSSC is 0.1 5M NaCi and 15mM sodium citrate) m the 
hybridization and wash buffers; washes are performed for 15 minutes after 
hybridization is complete. 

*T.-. - T r Th" hybndi/.a: iun temperature for hybrids anticipated to be less than 
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50 base pairs in length should be 5-10°C less than the melting temperature f _ N < the 
hybrid, where T m is determined according to the following equations. I oi hybrids less 
than 18 base pairs in length, T m (°C)=2(#of A + T bases) + 4(# of G + C bases). For 
hybrids between 18 and 49 base pairs in length, T m (°C)=81.5 + 16.6(log 10 [Na 4 ]) + 0.41 
5 (°/oG+C) - (600/N), where N is the number of bases in the hybrid, and [Na + ] is the 

concentration of sodium ions in the hybridization buffer ([Na + ] for lxSSC- 0 ^65M). 

Additional examples of stringency conditions for 
polynucleotide hybridization are provided in Sambm^ir . t 

10 E.F. Fritsch, and T. Maniatis, 1989, Molecular ^ >ing* A 
Laboratory Manual, Cold Spring Harbor Laboratory Cold 
Spring Harbor, NY, chapters 9 and 11, and Current Protocols 
in Molecular Biology, 1995, F.M. Ausubel et al., "ohn 
Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, inco^por ^.ed 

15 herein by reference. 

Preferably, each such hybridizing polynucleotide a 
length that is at least 25% (more preferably at least !i0*. 
and most preferably at least 75%) of the length of the 
polynucleotide of the present invention tc winch it 

20 hybridizes, and has at least 60% sequence ident lt^ (mere 
preferably, at least 75% identity; most preferably . t lea i L 
90% or 95% identity) with the polynucleotide of the j.^pserit 
invention to which it hybridizes, where sequence-* jd=>n i : / x s 
determined by comparing the sequences of th- hybri^i^.i^q 

25 polynucleotides when aligned so as to maxi^iz. )vri .,. ^, lno 
identity while minimizing sequence gaps. 



